Download or read book Teradata Cookbook written by Abhinav Khandelwal and published by Packt Publishing Ltd. This book was released on 2018-02-15 with total page 448 pages. Available in PDF, EPUB and Kindle. Book excerpt: Data management and analytics simplified with Teradata Key Features Take your understanding of Teradata to the next level and build efficient data warehousing applications for your organization Covers recipes on data handling, warehousing, advanced querying and the administrative tasks in Teradata. Contains practical solutions to tackle common (and not-so-common) problems you might encounter in your day to day activities Book Description Teradata is an enterprise software company that develops and sells its eponymous relational database management system (RDBMS), which is considered to be a leading data warehousing solutions and provides data management solutions for analytics. This book will help you get all the practical information you need for the creation and implementation of your data warehousing solution using Teradata. The book begins with recipes on quickly setting up a development environment so you can work with different types of data structuring and manipulation function. You will tackle all problems related to efficient querying, stored procedure searching, and navigation techniques. Additionally, you’ll master various administrative tasks such as user and security management, workload management, high availability, performance tuning, and monitoring. This book is designed to take you through the best practices of performing the real daily tasks of a Teradata DBA, and will help you tackle any problem you might encounter in the process. What you will learn Understand Teradata's competitive advantage over other RDBMSs. Use SQL to process data stored in Teradata tables. Leverage Teradata’s available application utilities and parallelism to play with large datasets Apply various performance tuning techniques to optimize the queries. Acquire deeper knowledge and understanding of the Teradata Architecture. Easy steps to load, archive, restore data and implement Teradata protection features Gain confidence in running a wide variety of Data analytics and develop applications for the Teradata environment Who this book is for This book is for Database administrator's and Teradata users who are looking for a practical, one-stop resource to solve all their problems while handling their Teradata solution. If you are looking to learn the basic as well as the advanced tasks involved in Teradata querying or administration, this book will be handy. Some knowledge of relational database concepts will be helpful to get the best out of this book.
Download or read book Snowflake Cookbook written by Hamid Mahmood Qureshi and published by Packt Publishing Ltd. This book was released on 2021-02-25 with total page 330 pages. Available in PDF, EPUB and Kindle. Book excerpt: Develop modern solutions with Snowflake's unique architecture and integration capabilities; process bulk and real-time data into a data lake; and leverage time travel, cloning, and data-sharing features to optimize data operations Key Features Build and scale modern data solutions using the all-in-one Snowflake platform Perform advanced cloud analytics for implementing big data and data science solutions Make quicker and better-informed business decisions by uncovering key insights from your data Book Description Snowflake is a unique cloud-based data warehousing platform built from scratch to perform data management on the cloud. This book introduces you to Snowflake's unique architecture, which places it at the forefront of cloud data warehouses. You'll explore the compute model available with Snowflake, and find out how Snowflake allows extensive scaling through the virtual warehouses. You will then learn how to configure a virtual warehouse for optimizing cost and performance. Moving on, you'll get to grips with the data ecosystem and discover how Snowflake integrates with other technologies for staging and loading data. As you progress through the chapters, you will leverage Snowflake's capabilities to process a series of SQL statements using tasks to build data pipelines and find out how you can create modern data solutions and pipelines designed to provide high performance and scalability. You will also get to grips with creating role hierarchies, adding custom roles, and setting default roles for users before covering advanced topics such as data sharing, cloning, and performance optimization. By the end of this Snowflake book, you will be well-versed in Snowflake's architecture for building modern analytical solutions and understand best practices for solving commonly faced problems using practical recipes. What you will learn Get to grips with data warehousing techniques aligned with Snowflake's cloud architecture Broaden your skills as a data warehouse designer to cover the Snowflake ecosystem Transfer skills from on-premise data warehousing to the Snowflake cloud analytics platform Optimize performance and costs associated with a Snowflake solution Stage data on object stores and load it into Snowflake Secure data and share it efficiently for access Manage transactions and extend Snowflake using stored procedures Extend cloud data applications using Spark Connector Who this book is for This book is for data warehouse developers, data analysts, database administrators, and anyone involved in designing, implementing, and optimizing a Snowflake data warehouse. Knowledge of data warehousing and database and cloud concepts will be useful. Basic familiarity with Snowflake is beneficial, but not necessary.
Download or read book Apache Sqoop Cookbook written by Kathleen Ting and published by "O'Reilly Media, Inc.". This book was released on 2013-07-02 with total page 125 pages. Available in PDF, EPUB and Kindle. Book excerpt: Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop. Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems. Transfer data from a single database table into your Hadoop ecosystem Keep table data and Hadoop in sync by importing data incrementally Import data from more than one database table Customize transferred data by calling various database functions Export generated, processed, or backed-up data from Hadoop to your database Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler Load data into Hadoop’s data warehouse (Hive) or database (HBase) Handle installation, connection, and syntax issues common to specific database vendors
Download or read book Azure Data Factory Cookbook written by Dmitry Anoshin and published by Packt Publishing Ltd. This book was released on 2020-12-24 with total page 383 pages. Available in PDF, EPUB and Kindle. Book excerpt: Solve real-world data problems and create data-driven workflows for easy data movement and processing at scale with Azure Data Factory Key FeaturesLearn how to load and transform data from various sources, both on-premises and on cloudUse Azure Data Factory’s visual environment to build and manage hybrid ETL pipelinesDiscover how to prepare, transform, process, and enrich data to generate key insightsBook Description Azure Data Factory (ADF) is a modern data integration tool available on Microsoft Azure. This Azure Data Factory Cookbook helps you get up and running by showing you how to create and execute your first job in ADF. You’ll learn how to branch and chain activities, create custom activities, and schedule pipelines. This book will help you to discover the benefits of cloud data warehousing, Azure Synapse Analytics, and Azure Data Lake Gen2 Storage, which are frequently used for big data analytics. With practical recipes, you’ll learn how to actively engage with analytical tools from Azure Data Services and leverage your on-premise infrastructure with cloud-native tools to get relevant business insights. As you advance, you’ll be able to integrate the most commonly used Azure Services into ADF and understand how Azure services can be useful in designing ETL pipelines. The book will take you through the common errors that you may encounter while working with ADF and show you how to use the Azure portal to monitor pipelines. You’ll also understand error messages and resolve problems in connectors and data flows with the debugging capabilities of ADF. By the end of this book, you’ll be able to use ADF as the main ETL and orchestration tool for your data warehouse or data platform projects. What you will learnCreate an orchestration and transformation job in ADFDevelop, execute, and monitor data flows using Azure SynapseCreate big data pipelines using Azure Data Lake and ADFBuild a machine learning app with Apache Spark and ADFMigrate on-premises SSIS jobs to ADFIntegrate ADF with commonly used Azure services such as Azure ML, Azure Logic Apps, and Azure FunctionsRun big data compute jobs within HDInsight and Azure DatabricksCopy data from AWS S3 and Google Cloud Storage to Azure Storage using ADF's built-in connectorsWho this book is for This book is for ETL developers, data warehouse and ETL architects, software professionals, and anyone who wants to learn about the common and not-so-common challenges faced while developing traditional and hybrid ETL solutions using Microsoft's Azure Data Factory. You’ll also find this book useful if you are looking for recipes to improve or enhance your existing ETL pipelines. Basic knowledge of data warehousing is expected.
Download or read book Power Query Cookbook written by Andrea Janicijevic and published by Packt Publishing Ltd. This book was released on 2021-10-15 with total page 412 pages. Available in PDF, EPUB and Kindle. Book excerpt: Leverage your source data from hundreds of different connections, perform millions of different transformations, and easily manage highly complex data lifecycles with Power Query Key FeaturesCollect, combine, and transform data using Power Query's data connectivity and data preparation featuresOvercome the problems faced while accessing data from multiple sources and reshape it to meet your business requirementsExplore how the M language can be used to write your own customized solutionsBook Description Power Query is a data preparation tool that enables data engineers and business users to connect, reshape, enrich, and transform their data to facilitate relevant business insights and analysis. With Power Query's wide range of features, you can perform no-code transformations and complex M code functions at the same time to get the most out of your data. This Power Query book will help you to connect to data sources, achieve intuitive transformations, and get to grips with preparation practices. Starting with a general overview of Power Query and what it can do, the book advances to cover more complex topics such as M code and performance optimization. You'll learn how to extend these capabilities by gradually stepping away from the Power Query GUI and into the M programming language. Additionally, the book also shows you how to use Power Query Online within Power BI Dataflows. By the end of the book, you'll be able to leverage your source data, understand your data better, and enrich it with a full stack of no-code and custom features that you'll learn to design by yourself for your business requirements. What you will learnUnderstand how to use Power Query to connect and explore dataExplore ways to reshape and enrich dataDiscover the potential of Power Query across the Microsoft platformBuild complex and custom transformationsUse M code to write new queries against data sourcesUse the Power Query Online tool within Power BI DataflowsImplement best practices such as reusing dataflows, optimizing expanding table operations, and field mappingWho this book is for This book is for data analysts, BI developers, data engineers, and anyone looking for a desk reference guide to learn how Power Query can be used with different Microsoft products to handle data of varying complexity. Beginner-level knowledge of Power BI and the M Language will help you to get the best out of this book.
Download or read book Apache Sqoop Cookbook written by Kathleen Ting and published by "O'Reilly Media, Inc.". This book was released on 2013-07-02 with total page 95 pages. Available in PDF, EPUB and Kindle. Book excerpt: Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop. Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems. Transfer data from a single database table into your Hadoop ecosystem Keep table data and Hadoop in sync by importing data incrementally Import data from more than one database table Customize transferred data by calling various database functions Export generated, processed, or backed-up data from Hadoop to your database Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler Load data into Hadoop’s data warehouse (Hive) or database (HBase) Handle installation, connection, and syntax issues common to specific database vendors
Download or read book Tableau Prep Cookbook written by Hendrik Kleine and published by Packt Publishing Ltd. This book was released on 2021-03-19 with total page 289 pages. Available in PDF, EPUB and Kindle. Book excerpt: Explore common and not-so-common data transformation scenarios and solutions to become well-versed with Tableau Prep and create efficient and powerful data pipelines Key FeaturesCombine, clean, and shape data for analysis using self-service data preparation techniquesBecome proficient with Tableau Prep for building and managing data flows across your organizationLearn how to combine multiple data transformations in order to build a robust datasetBook Description Tableau Prep is a tool in the Tableau software suite, created specifically to develop data pipelines. This book will describe, in detail, a variety of scenarios that you can apply in your environment for developing, publishing, and maintaining complex Extract, Transform and Load (ETL) data pipelines. The book starts by showing you how to set up Tableau Prep Builder. You'll learn how to obtain data from various data sources, including files, databases, and Tableau Extracts. Next, the book demonstrates how to perform data cleaning and data aggregation in Tableau Prep Builder. You'll also gain an understanding of Tableau Prep Builder and how you can leverage it to create data pipelines that prepare your data for downstream analytics processes, including reporting and dashboard creation in Tableau. As part of a Tableau Prep flow, you'll also explore how to use R and Python to implement data science components inside a data pipeline. In the final chapter, you'll apply the knowledge you've gained to build two use cases from scratch, including a data flow for a retail store to prepare a robust dataset using multiple disparate sources and a data flow for a call center to perform ad hoc data analysis. By the end of this book, you'll be able to create, run, and publish Tableau Prep flows and implement solutions to common problems in data pipelines. What you will learnPerform data cleaning and preparation techniques for advanced data analysisUnderstand how to combine multiple disparate datasetsPrepare data for different Business Intelligence (BI) toolsApply Tableau Prep's calculation language to create powerful calculationsUse Tableau Prep for ad hoc data analysis and data science flowsDeploy Tableau Prep flows to Tableau Server and Tableau OnlineWho this book is for This book is for business intelligence professionals, data analysts, and Tableau users looking to learn Tableau Prep essentials and create data pipelines or ETL processes using it. Beginner-level knowledge of data management will be beneficial to understand the concepts covered in this Tableau cookbook more effectively.
Download or read book Chicano Eats written by Esteban Castillo and published by HarperCollins. This book was released on 2020-06-30 with total page 478 pages. Available in PDF, EPUB and Kindle. Book excerpt: The winner of the Saveur Best New Voice People’s Choice Award takes us on a delicious tour through the diverse flavors and foods of Chicano cuisine. Growing up among the Latino population of Santa Ana, California, Esteban Castillo was inspired to create the blog, Chicano Eats, to showcase his love for design, cooking, and culture and provide a space for authentic Latino voices, recipes, and stories to be heard. Building on his blog, this bicultural cookbook includes eighty-five traditional and fusion Mexican recipes—as gorgeous to look at as they are sublime to eat. Chicano cuisine is Mexican food made by Chicanos (Mexican Americans) that has been shaped by the communities in the U.S. where they grew up. It is Mexican food that bisects borders and uses a group of traditional ingredients—chiles, beans, tortillas, corn, and tomatillos—and techniques while boldly incorporating many exciting new twists, local ingredients, and influences from other cultures and regions in the United States. Chicano Eats is packed with easy, flavorful recipes such as: Chicken con Chochoyotes (Chicken and Corn Masa Dumplings) Mac and Queso Fundido Birria (Beef Stew with a Guajillo Chile Broth) Toasted Coconut Horchata Chorizo-Spiced Squash Tacos Champurrado Chocolate Birthday Cake (Inspired by the Mexican drink made with milk and chocolate and thickened with corn masa) Cherry Lime Chia Agua Fresca Accompanied by more than 100 bright, modern photographs, Chicano Eats is a melting pot of delicious and nostalgic recipes, a literal blending of cultures through food that offer a taste of home for Latinos and introduces familiar flavors and ingredients in a completely different and original way for Americans of all ethnic heritages.
Download or read book Qlik Sense Cookbook written by Pablo Labbe and published by Packt Publishing Ltd. This book was released on 2018-08-30 with total page 326 pages. Available in PDF, EPUB and Kindle. Book excerpt: Create dynamic dashboards to perform interactive analytics for business intelligence operations Key Features Explore newly added features in Qlik Sense Discover best practices to work with data using Qlik Sense Learn to implement advanced functions for better data insight Book Description Qlik Sense allows you to explore simple and complex data to reveal hidden insight and data relationships that help you make quality decisions for overall productivity. An expert Qlik Sense user can use its features for business intelligence in an enterprise environment effectively. Qlik Sense Cookbook is an excellent guide for all aspiring Qlik Sense developers and will empower you to create featured desktop applications to obtain daily insights at work. This book takes you through the basics and advanced functions of Qlik Sense February 2018 release. You’ll start with a quick refresher on obtaining data from data files and databases, and move on to some more refined features including visualization, and scripting, as well as managing apps and user interfaces. You will then understand how to work with advanced functions like set analysis and set expressions. As you make your way through this book, you will uncover newly added features in Qlik Sense such as new visualizations, label expressions and colors for dimension and measures. By the end of this book, you will have explored various visualization extensions to create your own interactive dashboard with the required tips and tricks. This will help you overcome challenging situations while developing your applications in Qlik Sense. What you will learn Source, preview, and distribute your data through interactive dashboards Explore and work with the latest visualization functions Learn how to write and use script subroutines Make your UI advanced and intuitive with custom objects and indicators Use visualization extensions for your Qlik Sense dashboard Work with Aggr and learn to use it within set analysis Who this book is for Qlik Sense Cookbook is for data and BI analysts who want to become well versed with Qlik Sense to apply business intelligence in data. If you are a beginner in data analytics and want to adopt an independent recipe-based approach to learn the required concepts and services in detail, this book is ideal! Individuals with prior knowledge of its sister product, QlikView, will also benefit from this book. Familiarity with the basics of business intelligence is a prerequisite.
Download or read book Optimizing Transact SQL written by David Rozenshtein and published by . This book was released on 1995 with total page 132 pages. Available in PDF, EPUB and Kindle. Book excerpt: Encoded characteristic functions (ECF) is a new, innovative SQL programming methodology which allows programmers to encode conditional logic as scalar expressions within certain clauses. These extremely powerful techniques are presented by the authors of ECF in "Optimizing Transact-SQL".
Download or read book Azure Synapse Analytics Cookbook written by Gaurav Agarwal and published by Packt Publishing Ltd. This book was released on 2022-04-29 with total page 238 pages. Available in PDF, EPUB and Kindle. Book excerpt: Whether you're an Azure veteran or just getting started, get the most out of your data with effective recipes for Azure Synapse Key FeaturesDiscover new techniques for using Azure Synapse, regardless of your level of expertiseIntegrate Azure Synapse with other data sources to create a unified experience for your analytical needs using Microsoft AzureLearn how to embed data governance and classification with Synapse Analytics by integrating Azure PurviewBook Description As data warehouse management becomes increasingly integral to successful organizations, choosing and running the right solution is more important than ever. Microsoft Azure Synapse is an enterprise-grade, cloud-based data warehousing platform, and this book holds the key to using Synapse to its full potential. If you want the skills and confidence to create a robust enterprise analytical platform, this cookbook is a great place to start. You'll learn and execute enterprise-level deployments on medium-to-large data platforms. Using the step-by-step recipes and accompanying theory covered in this book, you'll understand how to integrate various services with Synapse to make it a robust solution for all your data needs. Whether you're new to Azure Synapse or just getting started, you'll find the instructions you need to solve any problem you may face, including using Azure services for data visualization as well as for artificial intelligence (AI) and machine learning (ML) solutions. By the end of this Azure book, you'll have the skills you need to implement an enterprise-grade analytical platform, enabling your organization to explore and manage heterogeneous data workloads and employ various data integration services to solve real-time industry problems. What you will learnDiscover the optimal approach for loading and managing dataWork with notebooks for various tasks, including MLRun real-time analytics using Azure Synapse Link for Cosmos DBPerform exploratory data analytics using Apache SparkRead and write DataFrames into Parquet files using PySparkCreate reports on various metrics for monitoring key KPIsCombine Power BI and Serverless for distributed analysisEnhance your Synapse analysis with data visualizationsWho this book is for This book is for data architects, data engineers, and developers who want to learn and understand the main concepts of Azure Synapse analytics and implement them in real-world scenarios.
Download or read book Introducing Microsoft Power BI written by Alberto Ferrari and published by Microsoft Press. This book was released on 2016-07-07 with total page 453 pages. Available in PDF, EPUB and Kindle. Book excerpt: This is the eBook of the printed book and may not include any media, website access codes, or print supplements that may come packaged with the bound book. Introducing Microsoft Power BI enables you to evaluate when and how to use Power BI. Get inspired to improve business processes in your company by leveraging the available analytical and collaborative features of this environment. Be sure to watch for the publication of Alberto Ferrari and Marco Russo's upcoming retail book, Analyzing Data with Power BI and Power Pivot for Excel (ISBN 9781509302765). Go to the book's page at the Microsoft Press Store here for more details:http://aka.ms/analyzingdata/details. Learn more about Power BI at https://powerbi.microsoft.com/.
Download or read book Learning Data Mining with Python written by Robert Layton and published by Packt Publishing Ltd. This book was released on 2015-07-29 with total page 344 pages. Available in PDF, EPUB and Kindle. Book excerpt: The next step in the information age is to gain insights from the deluge of data coming our way. Data mining provides a way of finding this insight, and Python is one of the most popular languages for data mining, providing both power and flexibility in analysis. This book teaches you to design and develop data mining applications using a variety of datasets, starting with basic classification and affinity analysis. Next, we move on to more complex data types including text, images, and graphs. In every chapter, we create models that solve real-world problems. There is a rich and varied set of libraries available in Python for data mining. This book covers a large number, including the IPython Notebook, pandas, scikit-learn and NLTK. Each chapter of this book introduces you to new algorithms and techniques. By the end of the book, you will gain a large insight into using Python for data mining, with a good knowledge and understanding of the algorithms and implementations.
Download or read book Mastering Microsoft Power BI written by Brett Powell and published by Packt Publishing Ltd. This book was released on 2018-03-29 with total page 632 pages. Available in PDF, EPUB and Kindle. Book excerpt: Design, create and manage robust Power BI solutions to gain meaningful business insights Key Features Master all the dashboarding and reporting features of Microsoft Power BI Combine data from multiple sources, create stunning visualizations and publish your reports across multiple platforms A comprehensive guide with real-world use cases and examples demonstrating how you can get the best out of Microsoft Power BI Book DescriptionThis book is intended for business intelligence professionals responsible for the design and development of Power BI content as well as managers, architects and administrators who oversee Power BI projects and deployments. The chapters flow from the planning of a Power BI project through the development and distribution of content to the administration of Power BI for an organization. BI developers will learn how to create sustainable and impactful Power BI datasets, reports, and dashboards. This includes connecting to data sources, shaping and enhancing source data, and developing an analytical data model. Additionally, top report and dashboard design practices are described using features such as Bookmarks and the Power KPI visual. BI managers will learn how Power BI’s tools work together such as with the On-premises data gateway and how content can be staged and securely distributed via Apps. Additionally, both the Power BI Report Server and Power BI Premium are reviewed. By the end of this book, you will be confident in creating effective charts, tables, reports or dashboards for any kind of data using the tools and techniques in Microsoft Power BI.What you will learn Build efficient data retrieval and transformation processes with the Power Query M Language Design scalable, user-friendly DirectQuery and Import Data Models Develop visually rich, immersive, and interactive reports and dashboards Maintain version control and stage deployments across development, test, and production environments Manage and monitor the Power BI Service and the On-premises data gateway Develop a fully on-premise solution with the Power BI Report Server Scale up a Power BI solution via Power BI Premium capacity and migration to Azure Analysis Services or SQL Server Analysis Services Who this book is for Business Intelligence professionals and existing Power BI users looking to master Power BI for all their data visualization and dashboarding needs will find this book to be useful. While understanding of the basic BI concepts is required, some exposure to Microsoft Power BI will be helpful.
Download or read book Learning Apache Spark 2 written by Muhammad Asif Abbasi and published by Packt Publishing Ltd. This book was released on 2017-03-28 with total page 349 pages. Available in PDF, EPUB and Kindle. Book excerpt: Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analytics About This Book Exclusive guide that covers how to get up and running with fast data processing using Apache Spark Explore and exploit various possibilities with Apache Spark using real-world use cases in this book Want to perform efficient data processing at real time? This book will be your one-stop solution. Who This Book Is For This guide appeals to big data engineers, analysts, architects, software engineers, even technical managers who need to perform efficient data processing on Hadoop at real time. Basic familiarity with Java or Scala will be helpful. The assumption is that readers will be from a mixed background, but would be typically people with background in engineering/data science with no prior Spark experience and want to understand how Spark can help them on their analytics journey. What You Will Learn Get an overview of big data analytics and its importance for organizations and data professionals Delve into Spark to see how it is different from existing processing platforms Understand the intricacies of various file formats, and how to process them with Apache Spark. Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager. Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formats Understand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark. Introduce yourself to the deployment and usage of SparkR. Walk through the importance of Graph computation and the graph processing systems available in the market Check the real world example of Spark by building a recommendation engine with Spark using ALS. Use a Telco data set, to predict customer churn using Random Forests. In Detail Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing Hadoop installation and configuring with Yarn and Mesos. The next part of the journey after installation is using key components, APIs, Clustering, machine learning APIs, data pipelines, parallel programming. It is important to understand why each framework component is key, how widely it is being used, its stability and pertinent use cases. Once we understand the individual components, we will take a couple of real life advanced analytics examples such as 'Building a Recommendation system', 'Predicting customer churn' and so on. The objective of these real life examples is to give the reader confidence of using Spark for real-world problems. Style and approach With the help of practical examples and real-world use cases, this guide will take you from scratch to building efficient data applications using Apache Spark. You will learn all about this excellent data processing engine in a step-by-step manner, taking one aspect of it at a time. This highly practical guide will include how to work with data pipelines, dataframes, clustering, SparkSQL, parallel programming, and such insightful topics with the help of real-world use cases.
Download or read book Mastering Tableau 2021 written by Marleen Meier and published by Packt Publishing Ltd. This book was released on 2021-05-31 with total page 793 pages. Available in PDF, EPUB and Kindle. Book excerpt: Build, design, and improve advanced business intelligence solutions using Tableau's latest features, including Tableau Prep Builder, Tableau Hyper, and Tableau Server Key FeaturesMaster new features in Tableau 2021 to solve real-world analytics challengesPerform geo-spatial, time series, and self-service analytics using real-life examplesBuild and publish dashboards and explore storytelling using Python and R integration supportBook Description Tableau is one of the leading business intelligence (BI) tools that can help you solve data analysis challenges. With this book, you will master Tableau's features and offerings in various paradigms of the BI domain. Updated with fresh topics including Quick Level of Detail expressions, the newest Tableau Server features, Einstein Discovery, and more, this book covers essential Tableau concepts and advanced functionalities. Leveraging Tableau Hyper files and using Prep Builder, you'll be able to perform data preparation and handling easily. You'll gear up to perform complex joins, spatial joins, unions, and data blending tasks using practical examples. Next, you'll learn how to execute data densification and further explore expert-level examples to help you with calculations, mapping, and visual design using Tableau extensions. You'll also learn about improving dashboard performance, connecting to Tableau Server and understanding data visualization with examples. Finally, you'll cover advanced use cases such as self-service analysis, time series analysis, and geo-spatial analysis, and connect Tableau to Python and R to implement programming functionalities within it. By the end of this Tableau book, you'll have mastered the advanced offerings of Tableau 2021 and be able to tackle common and advanced challenges in the BI domain. What you will learnGet up to speed with various Tableau componentsMaster data preparation techniques using Tableau Prep BuilderDiscover how to use Tableau to create a PowerPoint-like presentationUnderstand different Tableau visualization techniques and dashboard designsInteract with the Tableau server to understand its architecture and functionalitiesStudy advanced visualizations and dashboard creation techniquesBrush up on powerful self-service analytics, time series analytics, and geo-spatial analyticsWho this book is for This book is designed for business analysts, business intelligence professionals and data analysts who want to master Tableau to solve a range of data science and business intelligence problems. The book is ideal if you have a good understanding of Tableau and want to take your skills to the next level.
Download or read book Getting Started with Oracle Data Integrator 11g written by David Hecksel and published by Packt Publishing Ltd. This book was released on 2012-01-01 with total page 503 pages. Available in PDF, EPUB and Kindle. Book excerpt: Combine high volume data movement, complex transformations and real-time data integration with the robust capabilities of ODI in this practical guide.