Machine learning applications | ||
---|---|---|
Anomaly detection | Data mining social media (like Facebook, Twitter, LinkedIn) | Predict mortgage loan defaults |
Chatbots | Detecting objects in scenes | Natural language translation (English to Spanish, French to Japanese, etc.) |
Classifying emails as spam or not spam | Detecting patterns in data | Recommender systems (“people who bought this product also bought…”) |
Classifying news articles as sports, financial, politics, etc. | Diagnostic medicine | Self-Driving cars (more generally, autonomous vehicles) |
Computer vision and image classification | Facial recognition | Sentiment analysis (like classifying movie reviews as positive, negative or neutral) |
Credit-card fraud detection | Handwriting recognition | Spam filtering |
Customer churn prediction | Insurance fraud detection | Time series predictions like stock-price forecasting and weather forecasting |
Data compression | Intrusion detection in computer networks | Voice recognition |
Data exploration | Marketing: Divide customers into clusters |
openml.org
). LinearRegression
estimator. LinearRegression
estimator to perform multiple linear regression with the California Housing dataset that’s bundled with scikit-learn. LinearRegression
estimator, by default, uses all the numerical features in a dataset to make more sophisticated predictions than you can with a single-feature simple linear regression. TSNE
estimator) to compress the Digits dataset’s 64 features down to two for visualization purposes. PCA
estimator) to compress the Iris dataset’s four features to two for visualization purposes. Datasets bundled with scikit-learn | |
---|---|
"Toy" datasets | Real-world datasets |
Boston house prices | Olivetti faces |
Iris plants | 20 newsgroups text |
Diabetes | Labeled Faces in the Wild face recognition |
Optical recognition of handwritten digits | Forest cover types |
Linnerrud | RCV1 |
Wine recognition | Kddcup 99 |
Breast cancer Wisconsin (diagnostic) | California Housing |
©1992–2020 by Pearson Education, Inc. All Rights Reserved. This content is based on Chapter 5 of the book Intro to Python for Computer Science and Data Science: Learning to Program with AI, Big Data and the Cloud.
DISCLAIMER: The authors and publisher of this book have used their best efforts in preparing the book. These efforts include the development, research, and testing of the theories and programs to determine their effectiveness. The authors and publisher make no warranty of any kind, expressed or implied, with regard to these programs or to the documentation contained in these books. The authors and publisher shall not be liable in any event for incidental or consequential damages in connection with, or arising out of, the furnishing, performance, or use of these programs.