Capability
Pandas Api On Spark For Familiar Dataframe Operations At Scale
5 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →Top Matches
Unified engine for large-scale data processing and ML.
Unique: Pandas API on Spark translates Pandas operations to Spark SQL/DataFrame operations, enabling code portability without rewriting — a compatibility layer enabling gradual migration from Pandas to Spark
vs others: More familiar to Pandas users than native Spark API; enables code reuse without rewriting; slower than native Spark API but faster than single-machine Pandas for large datasets