.NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write .NET code allowing you to reuse all the knowledge, skills, code, and libraries you already have as a .NET developer. .NET for Apache Spark runs on Windows, Linux, and macOS using .NET Core, or Windows using .NET Framework. It also runs on all major cloud providers including Azure HDInsight Spark, Amazon EMR Spark, AWS & Azure Databricks.

Features

  • Apache Spark™ is a general-purpose distributed processing engine for analytics over large data sets
  • Processing tasks are distributed over a cluster of nodes
  • Data is cached in-memory, to reduce computation time
  • Apache Spark is often used for high-volume data preparation pipelines, such as extract, transform, and load (ETL) processes
  • Large streams of data can be processed in real-time with Apache Spark, such as monitoring streams of sensor data or analyzing financial transactions to detect fraud
  • Apache Spark can reduce the cost and time involved in building machine learning models

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow .NET for Apache Spark

.NET for Apache Spark Web Site

Other Useful Business Software
EHS Software and Management System Icon
EHS Software and Management System

ERA offers the only full EHS&Q platform with advanced automation to drive your complete compliance.

ERA Environmental Software Solutions develops web-based EHS management software for small, medium, and large manufacturers needing to comply with federal, provincial, and state regulations, monitor their air, water, and waste emissions and other environmental outputs, author and manage Safety Data Sheets (SDS) in more than 40 languages, or standardize their Health and Safety procedures for incident and inspection tracking, training delivery, and audit management. The platform also supports comprehensive reporting for programs like TRI, Tier II, Title V, NEI, and NPRI. Companies across the automotive, aerospace, general manufacturing, and paints and coatings industries, to name a few, rely on ERA’s all-in-one, SOC 2 Type II certified SaaS for complete coverage of their EHS needs.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of .NET for Apache Spark!