Summary The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem. About the book Spark in Action, Second Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms. What's inside Writing Spark applications in Java Spark application architecture Ingestion through files, databases, streaming, and Elasticsearch Querying distributed datasets with Spark SQL About the reader This book does not assume previous experience with Spark, Scala, or Hadoop. About the author Jean-Georges Perrin is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years. Table of Contents PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES 1 So, what is Spark, anyway? 2 Architecture and flow 3 The majestic role of the dataframe 4 Fundamentally lazy 5 Building a simple app for deployment 6 Deploying your simple app PART 2 - INGESTION 7 Ingestion from files 8 Ingestion from databases 9 Advanced ingestion: finding data sources and building your own 10 Ingestion through structured streaming PART 3 - TRANSFORMING YOUR DATA 11 Working with SQL 12 Transforming your data 13 Transforming entire documents 14 Extending transformations with user-defined functions 15 Aggregating your data PART 4 - GOING FURTHER 16 Cache and checkpoint: Enhancing Spark’s performances 17 Exporting data and building full data pipelines 18 Exploring deployment
✔ Author(s): Jean-Georges Perrin
✔ Title: Spark in Action, Second Edition: Covers Apache Spark 3 with Examples in Java, Python, and Scala
✔ Rating : 4.5 out of 5 base on (41 reviews)
✔ ISBN-10: 1617295523
✔ Language: English
✔ Format ebook: PDF, EPUB, Kindle, Audio, HTML and MOBI
✔ Device compatibles: Android, iOS, PC and Amazon Kindle
Readers' opinions about Spark in Action, Second Edition by Jean-Georges Perrin
Sara Poole
Travel back in time with a historical epic that vividly recreates a bygone era. The author's meticulous research and engaging prose transport you to another world. Complex characters and intricate plots keep you enthralled from beginning to end. Each chapter reveals new insights into the period's culture and society. It's a captivating blend of history and fiction. Ideal for history buffs and lovers of epic sagas.
Goldarina Wilson
Discover the poignant story of a family navigating life's ups and downs in this moving novel. The author's empathetic writing and well-drawn characters create a deeply emotional experience. Each chapter explores themes of love, loss, and resilience with sensitivity. The plot's twists and turns keep you engaged throughout. It's a heartwarming and thought-provoking read. Perfect for readers who enjoy stories about family dynamics.
Alicia Lawrence
Experience the inspiring journey of an individual overcoming incredible odds in this powerful memoir. The author's candid and heartfelt writing brings their story to life. Each chapter reveals the resilience and strength of the human spirit. The narrative is both informative and deeply moving, offering valuable life lessons. It's a story that motivates and inspires, making it a must-read. Perfect for those seeking inspiration and personal growth.
Human Relations in Organizations: Applications and Skill Building, The Killing Hills (The Mick Hardin Novels, 1), Hiking West Virginia: A Guide to the State’s Greatest Hiking Adventures (State Hiking Guides Series), THE FIRE TOWER (The “Hanna and Alex” Low Country Mystery and Suspense Series.), Financial Statements, Third Edition: A Step-by-Step Guide to Understanding and Creating Financial Reports (Over 200,000 copies sold!), With The Heart In Mind, Mass Media and American Politics, Official TOEFL iBT Tests Volume 2, Second Edition, Ballad of a Sober Man: An ER Doctor’s Journey of Recovery, Point of Danger: (A Clean Contemporary Romantic Suspense Thriller) (Triple Threat),