Key Features
- Diverse real-world datasets to teach data mining techniques
- Practical and focused on real-world data mining cases, this book covers concepts such as spatial data mining, text mining, social media mining, and web mining
- Real-world case studies illustrate various data mining techniques, taking you from novice to intermediate
Book Description
The R language is a powerful open source functional programming language. At its core, R is a statistical programming language that provides impressive tools for data mining and analysis. It enables you to create high-level graphics and offers an interface to other languages. This means R is best suited to produce data and visual analytics through customization scripts and commands, instead of the typical statistical tools that provide tick boxes and drop-down menus for users.
This book explores data mining techniques and shows you how to apply different mining concepts to various statistical and data applications in a wide range of fields. We will teach you about R and its application to data mining, and give you relevant and useful information you can use to develop and improve your applications. It will help you complete complex data mining cases and guide you through handling issues you might encounter during projects.
What you will learn
- Make use of statistics and programming to learn data mining concepts and its applications
- Use R Programming to apply statistical models on data
- Create predictive models to be applied for performing classification, prediction and recommendation
- Use of various libraries available on R CRAN (comprehensive R archives network) in data mining
- Apply data management steps in handling large datasets
- Learn various data visualization libraries available in R for representing data
- Implement various dimension reduction techniques to handle large datasets
- Acquire knowledge about neural network concept drawn from computer science and its applications in data mining
About the Author
Pradeepta Mishra is a data scientist, predictive modeling expert, deep learning and machine learning practitioner, and an econometrician. He is currently leading the data science and machine learning practice for Ma Foi Analytics, Bangalore, India. Ma Foi Analytics is an advanced analytics provider for Tomorrow's Cognitive Insights Ecology, using a combination of cutting-edge artificial intelligence, proprietary big data platform, and data science expertise. He holds a patent for enhancing planogram design for the retail industry. Pradeepta has published and presented research papers at IIM Ahmedabad, India. He is a visiting faculty at various leading B-schools and regularly gives talks on data science and machine learning.
Pradeepta has spent more than 10 years in his domain and has solved various projects relating to classification, regression, pattern recognition, time series forecasting, and unstructured data analysis using text mining procedures, spanning across domains such as healthcare, insurance, retail and e-commerce, manufacturing, and so on.
If you have any questions, don't hesitate to look me up on Twitter via @mishra1_PK, I will be more than glad to help a fellow web professional wherever, whenever.
Table of Contents
- Data Manipulation Using In-built R Data
- Exploratory Data Analysis with Automobile Data
- Visualize Diamond Dataset
- Regression with Automobile Data
- Market Basket Analysis with Groceries Data
- Clustering with E-commerce Data
- Building a Retail Recommendation Engine
- Dimensionality Reduction
- Applying Neural Network to Healthcare Data