Skip to content
Scriptorium

Scriptorium

Ideas through Technologies

  • Home
  • Blogs
  • Thoughts
  • Culture & Media
    • Books
    • Movies
    • Movie Archives
  • AI/Cloud
    • AWS Architect
    • AWS Labs
    • Cloud & ML
  • Library
    • Study Notes
    • Tutorials

Tag Archives: Spark

[Spark By Example] Read CSV with Schema

The following sample code (by Python and C#) shows how to read CSV file with schema. You will see how you can specify the schema explicitly.

Continue reading “[Spark By Example] Read CSV with Schema”
Posted byPyongwon LeeNovember 8, 2022November 9, 2022Posted inCloud ComputingTags:PySpark, Python, SparkLeave a comment on [Spark By Example] Read CSV with Schema

[Spark By Example] Read CSV

The following sample code (by Python and C#) shows how to read CSV file without schema.

Continue reading “[Spark By Example] Read CSV”
Posted byPyongwon LeeNovember 8, 2022November 9, 2022Posted inCloud ComputingTags:PySpark, Python, SparkLeave a comment on [Spark By Example] Read CSV

[Spark By Example] Word Count

The following sample code (by Python and C#) shows how to count the word in a text file.

Continue reading “[Spark By Example] Word Count”
Posted byPyongwon LeeNovember 8, 2022November 8, 2022Posted inCloud ComputingTags:PySpark, Python, SparkLeave a comment on [Spark By Example] Word Count

Setting up .NET for Apache Spark

In general, you are developing Spark application using Scala, Python, or R. But do not panic if you are a C# developer. .NET for Apache Spark provides high-level APIs for using Spark from C#.

https://learn.microsoft.com/en-us/dotnet/spark/

Continue reading “Setting up .NET for Apache Spark”
Posted byPyongwon LeeNovember 4, 2022November 7, 2022Posted inCloud ComputingTags:Apache, PySpark, SparkLeave a comment on Setting up .NET for Apache Spark

Install PySpark on Windows

The first step of working with big data is to set up your environment. For learning and testing purposes, you can set the environment in a single machine. Let’s install PySpark on Windows 10 or Windows 11.

Continue reading “Install PySpark on Windows”
Posted byPyongwon LeeNovember 4, 2022December 3, 2022Posted inCloud ComputingTags:Apache, PySpark, SparkLeave a comment on Install PySpark on Windows

Posts navigation

Newer posts 1 2 3
  • LinkedIn
  • Twitter

Search in this blog

Scriptorium, Blog at WordPress.com.
  • Follow Following
    • Scriptorium
    • Join 64 other followers
    • Already have a WordPress.com account? Log in now.
    • Scriptorium
    • Customize
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar