Disclaimer: This repository is not actively maintained anymore. For an up-to-date introduction to AzureML, please see the official product documentation and example notebooks.
Introduction
Welcome to the ACE-team training on Azure Machine Learning (AML) service.
The material presented here is a deep-dive which combine real-world data science scenarios with many different technologies including Azure Databricks (ADB), Azure Machine Learning (AML) Services and Azure DevOps, with the goal of creating, deploying, and maintaining end-to-end data science and AI solutions.
Anomaly Detection in structured data
-
The data scientist has been tasked to develop a predictive maintenance (PdM) solution for a large set of production machines on a manufacturing floor.
-
The data scientist was asked to create a PdM solution that is executed weekly, to develop a maintenance schedule for the next week.
-
Previous experience suggests that anomalies in the telemetry data collected on each machine are indicative of impending catastrophic failures. The data scientist was therefore asked to include anomaly detection in their solution.
-
The organization also asked for a real-time anomaly detection service, to enable immediate machine inspection before the beginning of the next maintenance cycle.
Note: Anomaly detection can also be performed on unstructured data. One example is to detect unusual behavior in videos, like a car driving on a sidewalk, or violation of safety protocols on a manufacturing floor. If you are interested in this use case, please go to this repo: https://github.com/Microsoft/MLOps_VideoAnomalyDetection
Agendas
Please go to this page to find alternative agendas around the above use-cases.
References
- Azure Databricks
- Azure Machine Learning Services
- AML SDK for Python
- Azure DevOps
- Example notebooks covering various topics
Pre-requisites
Knowledge/Skills
You will need this basic knowledge:
- Basic data science and machine learning concepts.
- Moderate skills in coding with Python and machine learning using Python.
- Familiarity with Jupyter Notebooks and/or Databricks Notebooks.
- Familiarity with Azure databricks.
- Basic skills using Git version control.
If you do not have any of the above pre-requisites, please find below links:
- To Watch: Data Science for Beginners
- To Watch: Get Started with Azure Machine Learning
- To Watch: Python for Data Science: Introduction
- To Watch: Jupyter Notebook Tutorial: Introduction, Setup, and Walkthrough
- To Do: Go to [https://notebooks.azure.com/] and create and run a Jupyter notebook with Python
- To Watch: Azure Databricks: A brief introduction
- To Read (10 mins): Git Handbook
Infrastructure
- An Azure Subscription (unless provided to you).
- If you are not provided with a managed lab environment (course invitation will specify), then follow these instructions for configuring your development environment prior to the course or if you do it on your own. You will need an Azure Subscription (unless one is provided to you). Pay particular attention to version numbers, such as the version of the Spark runtime.
Contribute
We invite everybody to contribute.
- Email us with questions an comments:
aace@microsoft.com
- Submit issues and comments in our github repository: https://github.com/Azure/LearnAI-CustomAI-Airlift/issues