Содержание
- 2. Machine Learning everywhere! Mobile Embedded Automotive Desktops Games Finance Etc. Image from [1]
- 3. Dream team Developer Data Scientist
- 4. Dream team – synergy way Developer Data Scientist Research Developer
- 5. Dream team – process way Developer Data Scientist Communications
- 6. Machine learning sample cases Energy efficiency prediction Intrusion detection system Image classification
- 7. Buildings Energy Efficiency ref: [2] Input attributes Relative Compactness Surface Area Wall Area etc. Outcomes Heating
- 8. Regression problem
- 9. Regression problem
- 10. Regression problem
- 11. Quality metric
- 12. Baseline model class Predictor { public: using features = std::vector ; virtual ~Predictor() {}; virtual double
- 13. Linear regression class LinregPredictor: public Predictor { public: LinregPredictor(const std::vector &); double predict(const features& feat) const
- 14. Polynomial regression
- 15. Polynomial regression class PolyPredictor: public LinregPredictor { public: using LinregPredictor::LinregPredictor; double predict(const features& feat) const override
- 16. Integration testing you always have a lot of data for testing use python model output as
- 17. Intrusion detection system input - network traffic features protocol_type connection duration src_bytes dst_bytes etc. Output normal
- 18. Classification problem
- 19. Quality metrics Receive operation characteristics (ROC) curve
- 20. Baseline model always predict most frequent class ROC area under the curve = 0.5
- 21. Logistic regression
- 22. Logistic regression easy to implement template auto sigma(T z) { return 1/(1 + std::exp(-z)); } class
- 23. Gradient boosting de facto standard universal method multiple well known C++ implementations with python bindings XGBoost
- 24. CatBoost C API and C++ wrapper own build system (ymake) class CatboostClassifier: public BinaryClassifier { public:
- 25. CatBoost ROC-AUC = 0.9999
- 26. Image classification Handwritten digits recognizer – MNIST input – gray-scale pixels 28x28 output – digit on
- 27. Multilayer perceptron Image from: [4]
- 28. Quality metrics
- 29. Multilayer perceptron auto MlpClassifier::predict_proba(const features_t& feat) const { VectorXf x{feat.size()}; auto o1 = sigmav(w1_ * x);
- 30. Convolutional networks State of the Art algorithms in image processing a lot of C++ implementation with
- 31. Tensorflow C++ API Bazel build system Hint – prebuild C API
- 32. Conclusion Don’t be fear of the ML Try simpler things first Get benefits from different languages
- 33. References Andrew Ng, Machine Learning – coursera Energy efficiency Data Set KDD Cup 1999 MNIST training
- 35. Скачать презентацию