<?xml version="1.0" encoding="UTF-8"?>
<mods xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://www.loc.gov/mods/v3" version="3.1" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-1.xsd">
  <titleInfo>
    <title>Learn PySpark</title>
    <subTitle>build python-based machine learning and deep learning models</subTitle>
  </titleInfo>
  <name type="personal">
    <namePart>Singh, Pramod.</namePart>
    <role>
      <roleTerm authority="marcrelator" type="text">creator</roleTerm>
    </role>
  </name>
  <name type="corporate">
    <namePart>SpringerLink (Online service)</namePart>
  </name>
  <name type="corporate">
    <namePart>O'Reilly (Firm)</namePart>
  </name>
  <name type="corporate">
    <namePart>Serials Solutions</namePart>
  </name>
  <typeOfResource>text</typeOfResource>
  <genre authority="local">Electronic books.</genre>
  <originInfo>
    <place>
      <placeTerm type="code" authority="marccountry">xxu</placeTerm>
    </place>
    <place>
      <placeTerm type="text">New York</placeTerm>
    </place>
    <publisher>Apress</publisher>
    <dateIssued>2019</dateIssued>
    <edition>1st ed. </edition>
    <issuance>monographic</issuance>
  </originInfo>
  <language>
    <languageTerm authority="iso639-2b" type="code">eng</languageTerm>
  </language>
  <physicalDescription>
    <extent>xviii,210p. ; 23 cm.</extent>
  </physicalDescription>
  <abstract>Leverage machine and deep learning models to build applications on real-time data using PySpark. This book is perfect for those who want to learn to use this language to perform exploratory data analysis and solve an array of business challenges. You'll start by reviewing PySpark fundamentals, such as Spark's core architecture, and see how to use PySpark for big data processing like data ingestion, cleaning, and transformations techniques. This is followed by building workflows for analyzing streaming data using PySpark and a comparison of various streaming platforms. You'll then see how to schedule different spark jobs using Airflow with PySpark and book examine tuning machine and deep learning models for real-time predictions. This book concludes with a discussion on graph frames and performing network analysis using graph algorithms in PySpark. All the code presented in the book will be available in Python scripts on Github.</abstract>
  <tableOfContents>Chapter 1: Introduction to PySpark -- Chapter 2: Data Processing -- Chapter 3: Spark Structured Streaming -- Chapter 4: Airflow -- Chapter 5: Machine Learning Library (MLlib) -- Chapter 6: Supervised Machine Learning -- Chapter 7: Unsupervised Machine Learning -- Chapter 8: Deep Learning Using PySpark.</tableOfContents>
  <note type="statement of responsibility">By Pramod Singh.</note>
  <note>Requires an SPL library card.</note>
  <note>Mode of access: World Wide Web.</note>
  <subject authority="lcsh">
    <topic>Python (Computer program language)</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Big data</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Machine learning</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Open source software</topic>
  </subject>
  <subject authority="lcsh">
    <topic>Computer programming</topic>
  </subject>
  <classification authority="lcc">QA76.73.P98</classification>
  <classification authority="ddc" edition="1">005.76821 SINP</classification>
  <relatedItem type="host">
    <titleInfo>
      <title>Springer eBooks</title>
    </titleInfo>
  </relatedItem>
  <relatedItem type="otherFormat" displayLabel="Printed edition:"/>
  <relatedItem type="otherFormat" displayLabel="Printed edition:"/>
  <identifier type="isbn">9781484249604 </identifier>
  <identifier type="uri">https://ezproxy.spl.org/login?url=https://learning.oreilly.com/library/view/~/9781484249611/?ar</identifier>
  <location>
    <url displayLabel="View this electronic item in O'Reilly Online Learning: Academic/Public Library Edition.">https://ezproxy.spl.org/login?url=https://learning.oreilly.com/library/view/~/9781484249611/?ar</url>
  </location>
  <accessCondition type="restrictionOnAccess">Requires an SPL library card.</accessCondition>
  <recordInfo>
    <recordContentSource authority="marcorg"/>
    <recordCreationDate encoding="marc">190906</recordCreationDate>
    <recordChangeDate encoding="iso8601">20250110114257.0</recordChangeDate>
    <recordIdentifier source="WaSeSS">ssj0002239821</recordIdentifier>
  </recordInfo>
</mods>
