EBookClubs

Read Books & Download eBooks Full Online

EBookClubs

Read Books & Download eBooks Full Online

Book Data Storytelling with Altair and AI

Download or read book Data Storytelling with Altair and AI written by Angelica Lo Duca and published by Simon and Schuster. This book was released on 2024-09-24 with total page 568 pages. Available in PDF, EPUB and Kindle. Book excerpt: Great data presentations tell a story. Learn how to organize, visualize, and present data using Python, generative AI, and the cutting-edge Altair data visualization toolkit. Take the fast track to amazing data presentations! Data Storytelling with Altair and AI introduces a stack of useful tools and tried-and-tested methodologies that will rapidly increase your productivity, streamline the visualization process, and leave your audience inspired. In Data Storytelling with Altair and AI you’ll discover: • Using Python Altair for data visualization • Using Generative AI tools for data storytelling • The main concepts of data storytelling • Building data stories with the DIKW pyramid approach • Transforming raw data into a data story Data Storytelling with Altair and AI teaches you how to turn raw data into effective, insightful data stories. You’ll learn exactly what goes into an effective data story, then combine your Python data skills with the Altair library and AI tools to rapidly create amazing visualizations. Your bosses and decision-makers will love your new presentations—and you’ll love how quick Generative AI makes the whole process! About the technology Every dataset tells a story. After you’ve cleaned, crunched, and organized the raw data, it’s your job to share its story in a way that connects with your audience. Python’s Altair data visualization library, combined with generative AI tools like Copilot and ChatGPT, provide an amazing toolbox for transforming numbers, code, text, and graphics into intuitive data presentations. About the book Data Storytelling with Altair and AI teaches you how to build enhanced data visualizations using these tools. The book uses hands-on examples to build powerful narratives that can inform, inspire, and motivate. It covers the Altair data visualization library, along with AI techniques like generating text with ChatGPT, creating images with DALL-E, and Python coding with Copilot. You’ll learn by practicing with each interesting data story, from tourist arrivals in Portugal to population growth in the USA to fake news, salmon aquaculture, and more. What's inside • The Data-Information-Knowledge-Wisdom (DIKW) pyramid • Publish data stories using Streamlit, Tableau, and Comet • Vega and Vega-Lite visualization grammar About the reader For data analysts and data scientists experienced with Python. No previous knowledge of Altair or Generative AI required. About the author Angelica Lo Duca is a researcher at the Institute of Informatics and Telematics of the National Research Council, Italy. The technical editor on this book was Ninoslav Cerkez. Table of Contents PART 1 1 Introducing data storytelling 2 Running your first data story in Altair and GitHub Copilot 3 Reviewing the basic concepts of Altair 4 Generative AI tools for data storytelling PART 2 5 Crafting a data story using the DIKW pyramid 6 From data to information: Extracting insights 7 From information to knowledge: Building textual context 8 From information to knowledge: Building the visual context 9 From knowledge to wisdom: Adding next steps PART 3 10 Common issues while using generative AI 11 Publishing the data story A Technical requirements B Python pandas DataFrameC Other chart types

Book Storytelling with Data

    Book Details:
  • Author : Cole Nussbaumer Knaflic
  • Publisher : John Wiley & Sons
  • Release : 2015-10-09
  • ISBN : 1119002265
  • Pages : 284 pages

Download or read book Storytelling with Data written by Cole Nussbaumer Knaflic and published by John Wiley & Sons. This book was released on 2015-10-09 with total page 284 pages. Available in PDF, EPUB and Kindle. Book excerpt: Don't simply show your data—tell a story with it! Storytelling with Data teaches you the fundamentals of data visualization and how to communicate effectively with data. You'll discover the power of storytelling and the way to make data a pivotal point in your story. The lessons in this illuminative text are grounded in theory, but made accessible through numerous real-world examples—ready for immediate application to your next graph or presentation. Storytelling is not an inherent skill, especially when it comes to data visualization, and the tools at our disposal don't make it any easier. This book demonstrates how to go beyond conventional tools to reach the root of your data, and how to use your data to create an engaging, informative, compelling story. Specifically, you'll learn how to: Understand the importance of context and audience Determine the appropriate type of graph for your situation Recognize and eliminate the clutter clouding your information Direct your audience's attention to the most important parts of your data Think like a designer and utilize concepts of design in data visualization Leverage the power of storytelling to help your message resonate with your audience Together, the lessons in this book will help you turn your data into high impact visual stories that stick with your audience. Rid your world of ineffective graphs, one exploding 3D pie chart at a time. There is a story in your data—Storytelling with Data will give you the skills and power to tell it!

Book Python Data Visualization Essentials Guide

Download or read book Python Data Visualization Essentials Guide written by Kallur Rahman and published by BPB Publications. This book was released on 2021-07-30 with total page 319 pages. Available in PDF, EPUB and Kindle. Book excerpt: Build your data science skills. Start data visualization Using Python. Right away. Become a good data analyst by creating quality data visualizations using Python. KEY FEATURES ● Exciting coverage on loads of Python libraries, including Matplotlib, Seaborn, Pandas, and Plotly. ● Tons of examples, illustrations, and use-cases to demonstrate visual storytelling of varied datasets. ● Covers a strong fundamental understanding of exploratory data analysis (EDA), statistical modeling, and data mining. DESCRIPTION Data visualization plays a major role in solving data science challenges with various capabilities it offers. This book aims to equip you with a sound knowledge of Python in conjunction with the concepts you need to master to succeed as a data visualization expert. The book starts with a brief introduction to the world of data visualization and talks about why it is important, the history of visualization, and the capabilities it offers. You will learn how to do simple Python-based visualization with examples with progressive complexity of key features. The book starts with Matplotlib and explores the power of data visualization with over 50 examples. It then explores the power of data visualization using one of the popular exploratory data analysis-oriented libraries, Pandas. The book talks about statistically inclined data visualization libraries such as Seaborn. The book also teaches how we can leverage bokeh and Plotly for interactive data visualization. Each chapter is enriched and loaded with 30+ examples that will guide you in learning everything about data visualization and storytelling of mixed datasets. WHAT YOU WILL LEARN ● Learn to work with popular Python libraries and frameworks, including Seaborn, Bokeh, and Plotly. ● Practice your data visualization understanding across numerous datasets and real examples. ● Learn to visualize geospatial and time-series datasets. ● Perform correlation and EDA analysis using Pandas and Matplotlib. ● Get to know storytelling of complex and unstructured data using Bokeh and Pandas. ● Learn best practices in writing clean and short python scripts for a quicker visual summary of datasets. WHO THIS BOOK IS FOR This book is for all data analytics professionals, data scientists, and data mining hobbyists who want to be strong data visualizers by learning all the popular Python data visualization libraries. Prior working knowledge of Python is assumed. TABLE OF CONTENTS 1. Introduction to Data Visualization 2. Why Data Visualization 3. Various Data Visualization Elements and Tools 4. Using Matplotlib with Python 5. Using NumPy and Pandas for Plotting 6. Using Seaborn for Visualization 7. Using Bokeh with Python 8. Using Plotly, Folium, and Other Tools for Data Visualization 9. Hands-on Examples and Exercises, Case Studies, and Further Resources

Book Storytelling with Data

    Book Details:
  • Author : Cole Nussbaumer Knaflic
  • Publisher : John Wiley & Sons
  • Release : 2019-10-22
  • ISBN : 1119621496
  • Pages : 454 pages

Download or read book Storytelling with Data written by Cole Nussbaumer Knaflic and published by John Wiley & Sons. This book was released on 2019-10-22 with total page 454 pages. Available in PDF, EPUB and Kindle. Book excerpt: Influence action through data! This is not a book. It is a one-of-a-kind immersive learning experience through which you can become—or teach others to be—a powerful data storyteller. Let’s practice! helps you build confidence and credibility to create graphs and visualizations that make sense and weave them into action-inspiring stories. Expanding upon best seller storytelling with data’s foundational lessons, Let’s practice! delivers fresh content, a plethora of new examples, and over 100 hands-on exercises. Author and data storytelling maven Cole Nussbaumer Knaflic guides you along the path to hone core skills and become a well-practiced data communicator. Each chapter includes: ● Practice with Cole: exercises based on real-world examples first posed for you to consider and solve, followed by detailed step-by-step illustration and explanation ● Practice on your own: thought-provoking questions and even more exercises to be assigned or worked through individually, without prescribed solutions ● Practice at work: practical guidance and hands-on exercises for applying storytelling with data lessons on the job, including instruction on when and how to solicit useful feedback and refine for greater impact The lessons and exercises found within this comprehensive guide will empower you to master—or develop in others—data storytelling skills and transition your work from acceptable to exceptional. By investing in these skills for ourselves and our teams, we can all tell inspiring and influential data stories!

Book Introduction to Data Visualization and Storytelling

Download or read book Introduction to Data Visualization and Storytelling written by Jose Berengueres and published by Independently Published. This book was released on 2019-07-28 with total page 99 pages. Available in PDF, EPUB and Kindle. Book excerpt: An introduction to data visualization and data storytelling. This book explains (visually) the fundamental principles of a meaningful chart making at high level. No coding or statistics skills required. Audience: data visualization students, senior data scientists, prescriptive analytics consultants. Written by a design thinking professor and multiple-times awarded kaggle master, this book hits the sweet spot between abstraction and detail.

Book What the Dormouse Said

    Book Details:
  • Author : John Markoff
  • Publisher : Penguin
  • Release : 2005-04-21
  • ISBN : 1101201088
  • Pages : 462 pages

Download or read book What the Dormouse Said written by John Markoff and published by Penguin. This book was released on 2005-04-21 with total page 462 pages. Available in PDF, EPUB and Kindle. Book excerpt: “This makes entertaining reading. Many accounts of the birth of personal computing have been written, but this is the first close look at the drug habits of the earliest pioneers.” —New York Times Most histories of the personal computer industry focus on technology or business. John Markoff’s landmark book is about the culture and consciousness behind the first PCs—the culture being counter– and the consciousness expanded, sometimes chemically. It’s a brilliant evocation of Stanford, California, in the 1960s and ’70s, where a group of visionaries set out to turn computers into a means for freeing minds and information. In these pages one encounters Ken Kesey and the phone hacker Cap’n Crunch, est and LSD, The Whole Earth Catalog and the Homebrew Computer Lab. What the Dormouse Said is a poignant, funny, and inspiring book by one of the smartest technology writers around.

Book Caged

    Book Details:
  • Author : Ellison Cooper
  • Publisher : Macmillan + ORM
  • Release : 2018-07-10
  • ISBN : 125017385X
  • Pages : 320 pages

Download or read book Caged written by Ellison Cooper and published by Macmillan + ORM. This book was released on 2018-07-10 with total page 320 pages. Available in PDF, EPUB and Kindle. Book excerpt: An FBI analyst hunts for a sadistic serial killer in Washington, DC, in this “dark and mesmerizing” thriller—“equal parts Kathy Reichs and Thomas Harris” (Lisa Gardner). FBI neuroscientist Sayer Altair hunts for evil in the deepest recesses of the human mind. Still reeling from the death of her fiancé, she wants nothing more than to focus on her research into the brains of serial killers. But when the Washington, DC, police stumble upon a gruesome murder involving a girl who was starved to death while held in a cage, Sayer is called in to lead the investigation. Then the victim is identified as the daughter of a high profile senator—and Sayer is thrust into the spotlight. As public pressure mounts, she discovers that another girl has been taken and is teetering on the brink of death. With evidence unraveling around her, Sayer realizes that they are hunting a killer with a dangerous obsession . . . a killer who is closer than she thought.

Book Data Visualization Made Simple

Download or read book Data Visualization Made Simple written by Kristen Sosulski and published by Routledge. This book was released on 2018-09-27 with total page 285 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data Visualization Made Simple is a practical guide to the fundamentals, strategies, and real-world cases for data visualization, an essential skill required in today’s information-rich world. With foundations rooted in statistics, psychology, and computer science, data visualization offers practitioners in almost every field a coherent way to share findings from original research, big data, learning analytics, and more. In nine appealing chapters, the book: examines the role of data graphics in decision-making, sharing information, sparking discussions, and inspiring future research; scrutinizes data graphics, deliberates on the messages they convey, and looks at options for design visualization; and includes cases and interviews to provide a contemporary view of how data graphics are used by professionals across industries Both novices and seasoned designers in education, business, and other areas can use this book’s effective, linear process to develop data visualization literacy and promote exploratory, inquiry-based approaches to visualization problems.

Book Lost At Sea

    Book Details:
  • Author : Patrick Dillon
  • Publisher : Simon and Schuster
  • Release : 2000-08-02
  • ISBN : 0684869098
  • Pages : 294 pages

Download or read book Lost At Sea written by Patrick Dillon and published by Simon and Schuster. This book was released on 2000-08-02 with total page 294 pages. Available in PDF, EPUB and Kindle. Book excerpt: Recounts the story of the fishing boats Americus and Altair that capsized in the icy waters of the Bering Sea in 1983 and killed all on board. Includes reading guide.

Book Interactive Data Visualization with Python

Download or read book Interactive Data Visualization with Python written by Abha Belorkar and published by Packt Publishing Ltd. This book was released on 2020-04-14 with total page 362 pages. Available in PDF, EPUB and Kindle. Book excerpt: Create your own clear and impactful interactive data visualizations with the powerful data visualization libraries of Python Key FeaturesStudy and use Python interactive libraries, such as Bokeh and PlotlyExplore different visualization principles and understand when to use which oneCreate interactive data visualizations with real-world dataBook Description With so much data being continuously generated, developers, who can present data as impactful and interesting visualizations, are always in demand. Interactive Data Visualization with Python sharpens your data exploration skills, tells you everything there is to know about interactive data visualization in Python. You'll begin by learning how to draw various plots with Matplotlib and Seaborn, the non-interactive data visualization libraries. You'll study different types of visualizations, compare them, and find out how to select a particular type of visualization to suit your requirements. After you get a hang of the various non-interactive visualization libraries, you'll learn the principles of intuitive and persuasive data visualization, and use Bokeh and Plotly to transform your visuals into strong stories. You'll also gain insight into how interactive data and model visualization can optimize the performance of a regression model. By the end of the course, you'll have a new skill set that'll make you the go-to person for transforming data visualizations into engaging and interesting stories. What you will learnExplore and apply different interactive data visualization techniquesManipulate plotting parameters and styles to create appealing plotsCustomize data visualization for different audiencesDesign data visualizations using interactive librariesUse Matplotlib, Seaborn, Altair and Bokeh for drawing appealing plotsCustomize data visualization for different scenariosWho this book is for This book intends to provide a solid training ground for Python developers, data analysts and data scientists to enable them to present critical data insights in a way that best captures the user's attention and imagination. It serves as a simple step-by-step guide that demonstrates the different types and components of visualization, the principles, and techniques of effective interactivity, as well as common pitfalls to avoid when creating interactive data visualizations. Students should have an intermediate level of competency in writing Python code, as well as some familiarity with using libraries such as pandas.

Book How to Lead in Data Science

Download or read book How to Lead in Data Science written by Jike Chong and published by Simon and Schuster. This book was released on 2021-12-28 with total page 823 pages. Available in PDF, EPUB and Kindle. Book excerpt: A field guide for the unique challenges of data science leadership, filled with transformative insights, personal experiences, and industry examples. In How To Lead in Data Science you will learn: Best practices for leading projects while balancing complex trade-offs Specifying, prioritizing, and planning projects from vague requirements Navigating structural challenges in your organization Working through project failures with positivity and tenacity Growing your team with coaching, mentoring, and advising Crafting technology roadmaps and championing successful projects Driving diversity, inclusion, and belonging within teams Architecting a long-term business strategy and data roadmap as an executive Delivering a data-driven culture and structuring productive data science organizations How to Lead in Data Science is full of techniques for leading data science at every seniority level—from heading up a single project to overseeing a whole company's data strategy. Authors Jike Chong and Yue Cathy Chang share hard-won advice that they've developed building data teams for LinkedIn, Acorns, Yiren Digital, large asset-management firms, Fortune 50 companies, and more. You'll find advice on plotting your long-term career advancement, as well as quick wins you can put into practice right away. Carefully crafted assessments and interview scenarios encourage introspection, reveal personal blind spots, and highlight development areas. About the technology Lead your data science teams and projects to success! To make a consistent, meaningful impact as a data science leader, you must articulate technology roadmaps, plan effective project strategies, support diversity, and create a positive environment for professional growth. This book delivers the wisdom and practical skills you need to thrive as a data science leader at all levels, from team member to the C-suite. About the book How to Lead in Data Science shares unique leadership techniques from high-performance data teams. It’s filled with best practices for balancing project trade-offs and producing exceptional results, even when beginning with vague requirements or unclear expectations. You’ll find a clearly presented modern leadership framework based on current case studies, with insights reaching all the way to Aristotle and Confucius. As you read, you’ll build practical skills to grow and improve your team, your company’s data culture, and yourself. What's inside How to coach and mentor team members Navigate an organization’s structural challenges Secure commitments from other teams and partners Stay current with the technology landscape Advance your career About the reader For data science practitioners at all levels. About the author Dr. Jike Chong and Yue Cathy Chang build, lead, and grow high-performing data teams across industries in public and private companies, such as Acorns, LinkedIn, large asset-management firms, and Fortune 50 companies. Table of Contents 1 What makes a successful data scientist? PART 1 THE TECH LEAD: CULTIVATING LEADERSHIP 2 Capabilities for leading projects 3 Virtues for leading projects PART 2 THE MANAGER: NURTURING A TEAM 4 Capabilities for leading people 5 Virtues for leading people PART 3 THE DIRECTOR: GOVERNING A FUNCTION 6 Capabilities for leading a function 7 Virtues for leading a function PART 4 THE EXECUTIVE: INSPIRING AN INDUSTRY 8 Capabilities for leading a company 9 Virtues for leading a company PART 5 THE LOOP AND THE FUTURE 10 Landscape, organization, opportunity, and practice 11 Leading in data science and a future outlook

Book Pandas for Everyone

    Book Details:
  • Author : Daniel Y. Chen
  • Publisher : Addison-Wesley Professional
  • Release : 2017-12-15
  • ISBN : 0134547055
  • Pages : 1093 pages

Download or read book Pandas for Everyone written by Daniel Y. Chen and published by Addison-Wesley Professional. This book was released on 2017-12-15 with total page 1093 pages. Available in PDF, EPUB and Kindle. Book excerpt: The Hands-On, Example-Rich Introduction to Pandas Data Analysis in Python Today, analysts must manage data characterized by extraordinary variety, velocity, and volume. Using the open source Pandas library, you can use Python to rapidly automate and perform virtually any data analysis task, no matter how large or complex. Pandas can help you ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets. Pandas for Everyone brings together practical knowledge and insight for solving real problems with Pandas, even if you’re new to Python data analysis. Daniel Y. Chen introduces key concepts through simple but practical examples, incrementally building on them to solve more difficult, real-world problems. Chen gives you a jumpstart on using Pandas with a realistic dataset and covers combining datasets, handling missing data, and structuring datasets for easier analysis and visualization. He demonstrates powerful data cleaning techniques, from basic string manipulation to applying functions simultaneously across dataframes. Once your data is ready, Chen guides you through fitting models for prediction, clustering, inference, and exploration. He provides tips on performance and scalability, and introduces you to the wider Python data analysis ecosystem. Work with DataFrames and Series, and import or export data Create plots with matplotlib, seaborn, and pandas Combine datasets and handle missing data Reshape, tidy, and clean datasets so they’re easier to work with Convert data types and manipulate text strings Apply functions to scale data manipulations Aggregate, transform, and filter large datasets with groupby Leverage Pandas’ advanced date and time capabilities Fit linear models using statsmodels and scikit-learn libraries Use generalized linear modeling to fit models with different response variables Compare multiple models to select the “best” Regularize to overcome overfitting and improve performance Use clustering in unsupervised machine learning

Book Harlequin Valentine

Download or read book Harlequin Valentine written by Neil Gaiman and published by Dark Horse Comics. This book was released on 2016-11-30 with total page 43 pages. Available in PDF, EPUB and Kindle. Book excerpt: In this modern hardcover retelling of a classic commedia dell'arte legend of tomfoolery and hopeless, fawning love, creators Neil Gaiman (The Sandman, the Newbery Medal-winning The Graveyard Book) and John Bolton (Evil Dead) update the relationship of Harlequin and Columbine. A buffoon burdened with a brimming heart, Harlequin chases his sensible, oblivious Columbine around the city streets, having given his heart freely. Consumed with love, the impulsive clown sees his heart dragged about town, with a charming surprise to bend the tale in a modern direction. Gaiman's writing is poetic and as heartfelt as the subject matter. Bolton's art, a combination of digitally enhanced photorealism and dynamic painting, provides sensational depth with bright characters over fittingly muted backgrounds. Those who have spent Valentine's Day alone are aware that the cold February holiday can be hard to swallow. Gaiman and Bolton want you to know that all it takes is a steak knife, a fork, and a bottle of quality ketchup!

Book Tiddler

    Book Details:
  • Author : Julia Donaldson
  • Publisher : Scholastic Canada
  • Release : 2017-01-31
  • ISBN : 1443148989
  • Pages : 42 pages

Download or read book Tiddler written by Julia Donaldson and published by Scholastic Canada. This book was released on 2017-01-31 with total page 42 pages. Available in PDF, EPUB and Kindle. Book excerpt: This colourful undersea adventure from the bestselling creators of The Gruffalo and Stick Man is now available in board book format! Tiddler is late to school every day, and he always has an elaborate excuse for his teacher. One day, as Tiddler is thinking up his next story, a net sweeps him up and hauls him far away from his school. How will Tiddler find his way home? All he has to do is follow the trail of his biggest, fishiest story yet! Full of bright colour and bouncy repetition, this engaging book makes a fun introduction to the wonders of story-telling!

Book The Ugly Duckling

    Book Details:
  • Author : Hans Christian Andersen
  • Publisher :
  • Release : 1927
  • ISBN :
  • Pages : 52 pages

Download or read book The Ugly Duckling written by Hans Christian Andersen and published by . This book was released on 1927 with total page 52 pages. Available in PDF, EPUB and Kindle. Book excerpt: An ugly duckling spends an unhappy year ostracized by the other animals before he grows into a beautiful swan.

Book Gold

    Book Details:
  • Author : Isaac Asimov
  • Publisher : Harper Collins
  • Release : 2009-03-17
  • ISBN : 0061802700
  • Pages : 418 pages

Download or read book Gold written by Isaac Asimov and published by Harper Collins. This book was released on 2009-03-17 with total page 418 pages. Available in PDF, EPUB and Kindle. Book excerpt: The final collection of fiction and essays by the most celebrated science fiction author of all time—including the Hugo Award–winning story “Gold.” Isaac Asimov is widely considered both the inventor of science fiction as well as the genre’s greatest practitioner. This wide-ranging collection is the final and crowning achievement of his fifty-year career as a writer. It includes an introduction by the renowned science fiction author Orson Scott Card. The first section contains stories that range from the humorous to the profound, at the heart of which is the title story, “Gold,” a moving and revealing drama about a writer who gambles everything on a chance at immortality: a gamble Asimov himself made—and won. The second section contains the grand master’s ruminations on the SF genre itself. And the final section is comprised of Asimov’s thoughts on the craft and writing of science fiction.

Book Data Science Bookcamp

    Book Details:
  • Author : Leonard Apeltsin
  • Publisher : Simon and Schuster
  • Release : 2021-12-07
  • ISBN : 1638352305
  • Pages : 702 pages

Download or read book Data Science Bookcamp written by Leonard Apeltsin and published by Simon and Schuster. This book was released on 2021-12-07 with total page 702 pages. Available in PDF, EPUB and Kindle. Book excerpt: Learn data science with Python by building five real-world projects! Experiment with card game predictions, tracking disease outbreaks, and more, as you build a flexible and intuitive understanding of data science. In Data Science Bookcamp you will learn: - Techniques for computing and plotting probabilities - Statistical analysis using Scipy - How to organize datasets with clustering algorithms - How to visualize complex multi-variable datasets - How to train a decision tree machine learning algorithm In Data Science Bookcamp you’ll test and build your knowledge of Python with the kind of open-ended problems that professional data scientists work on every day. Downloadable data sets and thoroughly-explained solutions help you lock in what you’ve learned, building your confidence and making you ready for an exciting new data science career. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology A data science project has a lot of moving parts, and it takes practice and skill to get all the code, algorithms, datasets, formats, and visualizations working together harmoniously. This unique book guides you through five realistic projects, including tracking disease outbreaks from news headlines, analyzing social networks, and finding relevant patterns in ad click data. About the book Data Science Bookcamp doesn’t stop with surface-level theory and toy examples. As you work through each project, you’ll learn how to troubleshoot common problems like missing data, messy data, and algorithms that don’t quite fit the model you’re building. You’ll appreciate the detailed setup instructions and the fully explained solutions that highlight common failure points. In the end, you’ll be confident in your skills because you can see the results. What's inside - Web scraping - Organize datasets with clustering algorithms - Visualize complex multi-variable datasets - Train a decision tree machine learning algorithm About the reader For readers who know the basics of Python. No prior data science or machine learning skills required. About the author Leonard Apeltsin is the Head of Data Science at Anomaly, where his team applies advanced analytics to uncover healthcare fraud, waste, and abuse. Table of Contents CASE STUDY 1 FINDING THE WINNING STRATEGY IN A CARD GAME 1 Computing probabilities using Python 2 Plotting probabilities using Matplotlib 3 Running random simulations in NumPy 4 Case study 1 solution CASE STUDY 2 ASSESSING ONLINE AD CLICKS FOR SIGNIFICANCE 5 Basic probability and statistical analysis using SciPy 6 Making predictions using the central limit theorem and SciPy 7 Statistical hypothesis testing 8 Analyzing tables using Pandas 9 Case study 2 solution CASE STUDY 3 TRACKING DISEASE OUTBREAKS USING NEWS HEADLINES 10 Clustering data into groups 11 Geographic location visualization and analysis 12 Case study 3 solution CASE STUDY 4 USING ONLINE JOB POSTINGS TO IMPROVE YOUR DATA SCIENCE RESUME 13 Measuring text similarities 14 Dimension reduction of matrix data 15 NLP analysis of large text datasets 16 Extracting text from web pages 17 Case study 4 solution CASE STUDY 5 PREDICTING FUTURE FRIENDSHIPS FROM SOCIAL NETWORK DATA 18 An introduction to graph theory and network analysis 19 Dynamic graph theory techniques for node ranking and social network analysis 20 Network-driven supervised machine learning 21 Training linear classifiers with logistic regression 22 Training nonlinear classifiers with decision tree techniques 23 Case study 5 solution