EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book SQL for Data Scientists

    Book Details:
  • Author : Renee M. P. Teate
  • Publisher : John Wiley & Sons
  • Release : 2021-08-17
  • ISBN : 1119669391
  • Pages : 400 pages

Download or read book SQL for Data Scientists written by Renee M. P. Teate and published by John Wiley & Sons. This book was released on 2021-08-17 with total page 400 pages. Available in PDF, EPUB and Kindle. Book excerpt: Jump-start your career as a data scientist—learn to develop datasets for exploration, analysis, and machine learning SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis is a resource that’s dedicated to the Structured Query Language (SQL) and dataset design skills that data scientists use most. Aspiring data scientists will learn how to how to construct datasets for exploration, analysis, and machine learning. You can also discover how to approach query design and develop SQL code to extract data insights while avoiding common pitfalls. You may be one of many people who are entering the field of Data Science from a range of professions and educational backgrounds, such as business analytics, social science, physics, economics, and computer science. Like many of them, you may have conducted analyses using spreadsheets as data sources, but never retrieved and engineered datasets from a relational database using SQL, which is a programming language designed for managing databases and extracting data. This guide for data scientists differs from other instructional guides on the subject. It doesn’t cover SQL broadly. Instead, you’ll learn the subset of SQL skills that data analysts and data scientists use frequently. You’ll also gain practical advice and direction on "how to think about constructing your dataset." Gain an understanding of relational database structure, query design, and SQL syntax Develop queries to construct datasets for use in applications like interactive reports and machine learning algorithms Review strategies and approaches so you can design analytical datasets Practice your techniques with the provided database and SQL code In this book, author Renee Teate shares knowledge gained during a 15-year career working with data, in roles ranging from database developer to data analyst to data scientist. She guides you through SQL code and dataset design concepts from an industry practitioner’s perspective, moving your data scientist career forward!

Book Python for Data Science

Download or read book Python for Data Science written by Erick Thompson and published by . This book was released on 2020-10-30 with total page 266 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Book Data Analytics for Absolute Beginners  a Deconstructed Guide to Data Literacy

Download or read book Data Analytics for Absolute Beginners a Deconstructed Guide to Data Literacy written by Oliver Theobald and published by . This book was released on 2019-07-21 with total page 88 pages. Available in PDF, EPUB and Kindle. Book excerpt: While exposure to data has become more or less a daily ritual for the rank-and-file knowledge worker, true understanding-treated in this book as data literacy-resides in knowing what lies behind the data. Everything from the data's source to the specific choice of input variables, algorithmic transformations, and visual representation shape the accuracy, relevance, and value of the data and mark its journey from raw data to business insight. It's also important to grasp the terminology and basic concepts of data analytics as much as it is to have the financial literacy to be successful as a decisionmaker in the business world. In this book, we make sense of data analytics without the assumption that you understand specific data science terminology or advanced programming languages to set you on your path. Topics covered in this book: Data Mining Big Data Machine Learning Alternative Data Data Management Web Scraping Regression Analysis Clustering Analysis Association Analysis Data Visualization Business Intelligence

Book Python for Data Science

    Book Details:
  • Author : Ethan Williams
  • Publisher :
  • Release : 2019-08-18
  • ISBN : 9781687159106
  • Pages : 200 pages

Download or read book Python for Data Science written by Ethan Williams and published by . This book was released on 2019-08-18 with total page 200 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book is a comprehensive guide for beginners to learn Python Programming, especially its application for Data Science. While the lessons in this book are targeted at the absolute beginner to programming, people at various levels of proficiency in Python, or any other programming languages can also learn some basics and concepts of data science. A few Python libraries are introduced, including NumPy, Pandas, Matplotlib, and Seaborn for data analysis and visualisation. To make the lessons more intuitive and relatable, practical examples and applications of each lesson are given. The reader is equally encouraged to practise the techniques via exercises, within and at the end of the relevant chapters. To help the reader get a full learning experience, there are references to relevant reading and practice materials, and the reader is encouraged to click these links and explore the possibilities they offer. It is expected that with consistency in learning and practicing the reader can master Python and the basics of its application in data science. The only limitation to the reader's progress, however, is themselves!

Book R for Data Science

    Book Details:
  • Author : Hadley Wickham
  • Publisher : "O'Reilly Media, Inc."
  • Release : 2016-12-12
  • ISBN : 1491910364
  • Pages : 521 pages

Download or read book R for Data Science written by Hadley Wickham and published by "O'Reilly Media, Inc.". This book was released on 2016-12-12 with total page 521 pages. Available in PDF, EPUB and Kindle. Book excerpt: Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you've learned along the way. You'll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true "signals" in your dataset Communicate—learn R Markdown for integrating prose, code, and results

Book Python Machine Learning for Beginners

Download or read book Python Machine Learning for Beginners written by Leonard Deep and published by . This book was released on 2019-05-13 with total page 236 pages. Available in PDF, EPUB and Kindle. Book excerpt: Are you interested to get into the programming world? Do you want to learn and understand Python and Machine Learning? Python Machine Learning for Beginners is the guide for you. Python Machine Learning for Beginners is the ultimate guide for beginners looking to learn and understand how Python programming works. Python Machine Learning for Beginners is split up into easy to learn chapters that will help guide the readers through the early stages of Python programming. It's this thought out and systematic approach to learning which makes Python Machine Learning for Beginners such a sought-after resource for those that want to learn about Python programming and about Machine Learning using an object-oriented programming approach. Inside Python Machine Learning for Beginners you will discover: An introduction to Machine Learning The main concepts of Machine Learning The basics of Python for beginners Machine Learning with Python Data Processing, Analysis, and Visualizations Case studies and much more! Throughout the book, you will learn the basic concepts behind Python programming which is designed to introduce you to Python programming. You will learn about getting started, the keywords and statements, data types and type conversion. Along with different examples, there are also exercises to help ensure that the information sinks in. You will find this book an invaluable tool for starting and mastering Machine Learning using Python. Once you complete Python Machine Learning for Beginners, you will be more than prepared to take on any Python programming. Scroll back up to the top of this page and hit BUY IT NOW to get your copy of Python Machine Learning for Beginners! You won't regret it!

Book Doing Data Science

    Book Details:
  • Author : Cathy O'Neil
  • Publisher : "O'Reilly Media, Inc."
  • Release : 2013-10-09
  • ISBN : 144936389X
  • Pages : 408 pages

Download or read book Doing Data Science written by Cathy O'Neil and published by "O'Reilly Media, Inc.". This book was released on 2013-10-09 with total page 408 pages. Available in PDF, EPUB and Kindle. Book excerpt: Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know. In many of these chapter-long lectures, data scientists from companies such as Google, Microsoft, and eBay share new algorithms, methods, and models by presenting case studies and the code they use. If you’re familiar with linear algebra, probability, and statistics, and have programming experience, this book is an ideal introduction to data science. Topics include: Statistical inference, exploratory data analysis, and the data science process Algorithms Spam filters, Naive Bayes, and data wrangling Logistic regression Financial modeling Recommendation engines and causality Data visualization Social networks and data journalism Data engineering, MapReduce, Pregel, and Hadoop Doing Data Science is collaboration between course instructor Rachel Schutt, Senior VP of Data Science at News Corp, and data science consultant Cathy O’Neil, a senior data scientist at Johnson Research Labs, who attended and blogged about the course.

Book A Data Scientist s Guide to Acquiring  Cleaning  and Managing Data in R

Download or read book A Data Scientist s Guide to Acquiring Cleaning and Managing Data in R written by Samuel E. Buttrey and published by John Wiley & Sons. This book was released on 2017-12-18 with total page 310 pages. Available in PDF, EPUB and Kindle. Book excerpt: The only how-to guide offering a unified, systemic approach to acquiring, cleaning, and managing data in R Every experienced practitioner knows that preparing data for modeling is a painstaking, time-consuming process. Adding to the difficulty is that most modelers learn the steps involved in cleaning and managing data piecemeal, often on the fly, or they develop their own ad hoc methods. This book helps simplify their task by providing a unified, systematic approach to acquiring, modeling, manipulating, cleaning, and maintaining data in R. Starting with the very basics, data scientists Samuel E. Buttrey and Lyn R. Whitaker walk readers through the entire process. From what data looks like and what it should look like, they progress through all the steps involved in getting data ready for modeling. They describe best practices for acquiring data from numerous sources; explore key issues in data handling, including text/regular expressions, big data, parallel processing, merging, matching, and checking for duplicates; and outline highly efficient and reliable techniques for documenting data and recordkeeping, including audit trails, getting data back out of R, and more. The only single-source guide to R data and its preparation, it describes best practices for acquiring, manipulating, cleaning, and maintaining data Begins with the basics and walks readers through all the steps necessary to get data ready for the modeling process Provides expert guidance on how to document the processes described so that they are reproducible Written by seasoned professionals, it provides both introductory and advanced techniques Features case studies with supporting data and R code, hosted on a companion website A Data Scientist's Guide to Acquiring, Cleaning and Managing Data in R is a valuable working resource/bench manual for practitioners who collect and analyze data, lab scientists and research associates of all levels of experience, and graduate-level data mining students.

Book Data Science For Dummies

Download or read book Data Science For Dummies written by Lillian Pierson and published by John Wiley & Sons. This book was released on 2021-08-20 with total page 436 pages. Available in PDF, EPUB and Kindle. Book excerpt: Monetize your company’s data and data science expertise without spending a fortune on hiring independent strategy consultants to help What if there was one simple, clear process for ensuring that all your company’s data science projects achieve a high a return on investment? What if you could validate your ideas for future data science projects, and select the one idea that’s most prime for achieving profitability while also moving your company closer to its business vision? There is. Industry-acclaimed data science consultant, Lillian Pierson, shares her proprietary STAR Framework – A simple, proven process for leading profit-forming data science projects. Not sure what data science is yet? Don’t worry! Parts 1 and 2 of Data Science For Dummies will get all the bases covered for you. And if you’re already a data science expert? Then you really won’t want to miss the data science strategy and data monetization gems that are shared in Part 3 onward throughout this book. Data Science For Dummies demonstrates: The only process you’ll ever need to lead profitable data science projects Secret, reverse-engineered data monetization tactics that no one’s talking about The shocking truth about how simple natural language processing can be How to beat the crowd of data professionals by cultivating your own unique blend of data science expertise Whether you’re new to the data science field or already a decade in, you’re sure to learn something new and incredibly valuable from Data Science For Dummies. Discover how to generate massive business wins from your company’s data by picking up your copy today.

Book Data Science for Business Professionals

Download or read book Data Science for Business Professionals written by Probyto Data Science and Consulting Pvt. Ltd. and published by BPB Publications. This book was released on 2020-05-06 with total page 368 pages. Available in PDF, EPUB and Kindle. Book excerpt: Primer into the multidisciplinary world of Data Science KEY FEATURESÊÊ - Explore and use the key concepts of Statistics required to solve data science problems - Use Docker, Jenkins, and Git for Continuous Development and Continuous Integration of your web app - Learn how to build Data Science solutions with GCP and AWS DESCRIPTIONÊ The book will initially explain the What-Why of Data Science and the process of solving a Data Science problem. The fundamental concepts of Data Science, such as Statistics, Machine Learning, Business Intelligence, Data pipeline, and Cloud Computing, will also be discussed. All the topics will be explained with an example problem and will show how the industry approaches to solve such a problem. The book will pose questions to the learners to solve the problems and build the problem-solving aptitude and effectively learn. The book uses Mathematics wherever necessary and will show you how it is implemented using Python with the help of an example dataset.Ê WHAT WILL YOU LEARNÊÊ - Understand the multi-disciplinary nature of Data Science - Get familiar with the key concepts in Mathematics and Statistics - Explore a few key ML algorithms and their use cases - Learn how to implement the basics of Data Pipelines - Get an overview of Cloud Computing & DevOps - Learn how to create visualizations using Tableau WHO THIS BOOK IS FORÊ This book is ideal for Data Science enthusiasts who want to explore various aspects of Data Science. Useful for Academicians, Business owners, and Researchers for a quick reference on industrial practices in Data Science.Ê TABLE OF CONTENTS 1. Data Science in Practice 2. Mathematics Essentials 3. Statistics Essentials 4. Exploratory Data Analysis 5. Data preprocessing 6. Feature Engineering 7. Machine learning algorithms 8. Productionizing ML models 9. Data Flows in Enterprises 10. Introduction to Databases 11. Introduction to Big Data 12. DevOps for Data Science 13. Introduction to Cloud Computing 14. Deploy Model to Cloud 15. Introduction to Business IntelligenceÊ 16. Data Visualization Tools 17. Industry Use Case 1 Ð FormAssist 18. Industry Use Case 2 Ð PeopleReporter 19. Data Science Learning Resources 20. Do It Your Self Challenges 21. MCQs for Assessments

Book A Beginner s Guide To DATA SCIENCE

Download or read book A Beginner s Guide To DATA SCIENCE written by Enamul Haque and published by . This book was released on 2023-01-06 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book is designed for aspiring data scientists who want to start their careers in data science, even if they don't have coding skills. It provides a comprehensive introduction to the foundations of data science and its applications, using simple language that is easy for beginners to understand. No technical expertise is required to master the material in this book. It is an ideal resource for anyone looking to learn about data science in an accessible and straightforward way. Key features include: Introduction to data science History of data science Data science life-cycle Data science tools and technologies Data science methodology Data science models Developing data science business strategy Managing data science projects Becoming a data scientist, data engineer etc. Big data Data Mining Artificial intelligence Machine learning Deep learning Neural networks Mathematical analysis Statistical modelling Understanding the fundamentals of data science programming languages Database structures and principles Robotic Process Automation Data science acronyms You need to know And a lot more.

Book A Beginners Guide To DATA SCIENCE

Download or read book A Beginners Guide To DATA SCIENCE written by Enamul Haque and published by . This book was released on 2021-03-31 with total page 288 pages. Available in PDF, EPUB and Kindle. Book excerpt: Calling all the Aspiring Data Scientists! This book is your "one-stop-shop" to kick start your data science career without knowing how to code! In fact, data science doesn't have to be complicated! With this book, you will grow an understanding of the foundations of data science and its applications. To master this book, you don't need technical abilities. This book is recommended for beginners and anybody who want to understand data science conveniently. You don't need a big textbook to master data science today. A straightforward language has been used to ensure ease of understanding, especially for beginners. Key features include: Introduction to data scienceHistory of data scienceData science life-cycleData science tools and technologiesData science methodologyData science modelsDeveloping data science business strategyManaging data science projectsBecoming a data scientist, data engineers etc.Doing data science without codingBig dataData MiningArtificial intelligenceMachine learningDeep learningNeural networksMathematical analysisStatistical modellingUnderstanding the fundamentals of Python and RDatabase structures and principlesRobotic Process AutomationData science acronyms you need to knowOnline free data science learning resources And a lot more

Book Data Science for Beginners

Download or read book Data Science for Beginners written by Alex Campbell and published by . This book was released on 2021-01-12 with total page 86 pages. Available in PDF, EPUB and Kindle. Book excerpt: Do you wonder what the fascination is around data these days? How do we obtain insights from this data? Do you know what a data scientist does? What is artificial intelligence and machine learning? Are these the same as data science? What does it take to become a data scientist? If you have ever wondered about these questions, you have come to the right place!There are many resources and courses online that you can use to learn more about data science, but with so much information available, it can become overwhelming. One of the best ways to learn about data science is to understand different machine learning concepts, statistics, and artificial intelligence to help you design models to perform an analysis.This book has all the information you need to learn what data science is, and what the prerequisites are to become a data scientist. If you're a beginner or if you already have experience in data science, this book will have something for you.In this book, you will: Learn what data science is about.Discover the difference between data science and business intelligence.Explore the tools required for data science.Find out the technical and non-technical skills every data scientist must have.Figure out how to create a visualization of the data set with clear and easy examples.Get advice on developing a Predictive Model Using R.Uncover detailed applications of data science.And much more!The book has been structured with easy-to-understand sections to help you learn everything you need to know about data science. In this book you will learn about the prerequisites of data science and the skills you need to become a data scientist. So, what are you waiting for? Grab your copy of this comprehensive guide now

Book The Beginner s Guide to Data Science

Download or read book The Beginner s Guide to Data Science written by Robert Ball and published by Springer Nature. This book was released on 2022-11-15 with total page 251 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book discusses the principles and practical applications of data science, addressing key topics including data wrangling, statistics, machine learning, data visualization, natural language processing and time series analysis. Detailed investigations of techniques used in the implementation of recommendation engines and the proper selection of metrics for distance-based analysis are also covered. Utilizing numerous comprehensive code examples, figures, and tables to help clarify and illuminate essential data science topics, the authors provide an extensive treatment and analysis of real-world questions, focusing especially on the task of determining and assessing answers to these questions as expeditiously and precisely as possible. This book addresses the challenges related to uncovering the actionable insights in “big data,” leveraging database and data collection tools such as web scraping and text identification. This book is organized as 11 chapters, structured as independent treatments of the following crucial data science topics: Data gathering and acquisition techniques including data creation Managing, transforming, and organizing data to ultimately package the information into an accessible format ready for analysis Fundamentals of descriptive statistics intended to summarize and aggregate data into a few concise but meaningful measurements Inferential statistics that allow us to infer (or generalize) trends about the larger population based only on the sample portion collected and recorded Metrics that measure some quantity such as distance, similarity, or error and which are especially useful when comparing one or more data observations Recommendation engines representing a set of algorithms designed to predict (or recommend) a particular product, service, or other item of interest a user or customer wishes to buy or utilize in some manner Machine learning implementations and associated algorithms, comprising core data science technologies with many practical applications, especially predictive analytics Natural Language Processing, which expedites the parsing and comprehension of written and spoken language in an effective and accurate manner Time series analysis, techniques to examine and generate forecasts about the progress and evolution of data over time Data science provides the methodology and tools to accurately interpret an increasing volume of incoming information in order to discern patterns, evaluate trends, and make the right decisions. The results of data science analysis provide real world answers to real world questions. Professionals working on data science and business intelligence projects as well as advanced-level students and researchers focused on data science, computer science, business and mathematics programs will benefit from this book.

Book Practitioner   s Guide to Data Science

Download or read book Practitioner s Guide to Data Science written by Nasir Ali Mirza and published by BPB Publications. This book was released on 2022-01-17 with total page 273 pages. Available in PDF, EPUB and Kindle. Book excerpt: Covers Data Science concepts, processes, and the real-world hands-on use cases. KEY FEATURES ● Covers the journey from a basic programmer to an effective Data Science developer. ● Applied use of Data Science native processes like CRISP-DM and Microsoft TDSP. ● Implementation of MLOps using Microsoft Azure DevOps. DESCRIPTION "How is the Data Science project to be implemented?" has never been more conceptually sounding, thanks to the work presented in this book. This book provides an in-depth look at the current state of the world's data and how Data Science plays a pivotal role in everything we do. This book explains and implements the entire Data Science lifecycle using well-known data science processes like CRISP-DM and Microsoft TDSP. The book explains the significance of these processes in connection with the high failure rate of Data Science projects. The book helps build a solid foundation in Data Science concepts and related frameworks. It teaches how to implement real-world use cases using data from the HMDA dataset. It explains Azure ML Service architecture, its capabilities, and implementation to the DS team, who will then be prepared to implement MLOps. The book also explains how to use Azure DevOps to make the process repeatable while we're at it. By the end of this book, you will learn strong Python coding skills, gain a firm grasp of concepts such as feature engineering, create insightful visualizations and become acquainted with techniques for building machine learning models. WHAT YOU WILL LEARN ● Organize Data Science projects using CRISP-DM and Microsoft TDSP. ● Learn to acquire and explore data using Python visualizations. ● Get well versed with the implementation of data pre-processing and Feature Engineering. ● Understand algorithm selection, model development, and model evaluation. ● Hands-on with Azure ML Service, its architecture, and capabilities. ● Learn to use Azure ML SDK and MLOps for implementing real-world use cases. WHO THIS BOOK IS FOR This book is intended for programmers who wish to pursue AI/ML development and build a solid conceptual foundation and familiarity with related processes and frameworks. Additionally, this book is an excellent resource for Software Architects and Managers involved in the design and delivery of Data Science-based solutions. TABLE OF CONTENTS 1. Data Science for Business 2. Data Science Project Methodologies and Team Processes 3. Business Understanding and Its Data Landscape 4. Acquire, Explore, and Analyze Data 5. Pre-processing and Preparing Data 6. Developing a Machine Learning Model 7. Lap Around Azure ML Service 8. Deploying and Managing Models

Book Data Analytics for Beginners

    Book Details:
  • Author : Robert J. Woz
  • Publisher : Createspace Independent Publishing Platform
  • Release : 2017-10
  • ISBN : 9781977843135
  • Pages : 112 pages

Download or read book Data Analytics for Beginners written by Robert J. Woz and published by Createspace Independent Publishing Platform. This book was released on 2017-10 with total page 112 pages. Available in PDF, EPUB and Kindle. Book excerpt: If you are convinced that the world today is producing more data than the previous decades, then you understand that processing yesterday's data for today's use at times is not enough. The level of data analysis that is needed in highly competitive business environment needs to be processed, analyzed and used immediately for businesses to be ahead of their competition. Having this in mind, you need to understand from the ground up, what data is, the different types of data and how you should identify the right data for your business. To help you understand the simple basics of data and how it needs to be analyzed, then Data Analytics for Beginners is the book that you have been waiting for. The size and type of business you are running doesn't matter because after all, it will depend on your ability to understand the data that your business is exposed to so as to make better business decisions for the current working environment and the future. Are there patterns in your business that you cannot see? Do you want to make sense of the shopping trends of your clients to better enrich their experience? Do you want to know your target market even more? Do you want to better derive insights from the feedback your clients give you? These questions can only be answered when you perform a data analysis for your business. Collecting the data is one thing, analyzing them is another matter entirely as it is not something that can be done haphazardly by just looking at the data. If you hope to understand your data well, you need to understand the data you are collecting, the methods to use and the right tools to use when analyzing the data. Inside you will find valuable steps and tools that will help make your information work for you. Do not let yourself get complacent, stop looking at the data that you collect each day and start analyzing your data to move your business up. Get started by buying this book today! Inside you will find How data should be understood? Terms and concepts used in data analysis. Data mining and the different kinds of databases used to store data. How information can be retrieved and manipulated in the database to create a visual representation of what you want to know? The life cycle of data analysis. And more...

Book Data Science from Scratch

Download or read book Data Science from Scratch written by Joel Grus and published by "O'Reilly Media, Inc.". This book was released on 2015-04-14 with total page 330 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data science libraries, frameworks, modules, and toolkits are great for doing data science, but they’re also a good way to dive into the discipline without actually understanding data science. In this book, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch. If you have an aptitude for mathematics and some programming skills, author Joel Grus will help you get comfortable with the math and statistics at the core of data science, and with hacking skills you need to get started as a data scientist. Today’s messy glut of data holds answers to questions no one’s even thought to ask. This book provides you with the know-how to dig those answers out. Get a crash course in Python Learn the basics of linear algebra, statistics, and probability—and understand how and when they're used in data science Collect, explore, clean, munge, and manipulate data Dive into the fundamentals of machine learning Implement models such as k-nearest Neighbors, Naive Bayes, linear and logistic regression, decision trees, neural networks, and clustering Explore recommender systems, natural language processing, network analysis, MapReduce, and databases