RepoNebiusNebiuspublished Sep 2, 2024seen 5d

nebius/nebius-connect

Python

Open original ↗

Captured source

source ↗
published Sep 2, 2024seen 5dcaptured 13hhttp 200method plain

nebius/nebius-connect

Language: Python

License: Apache-2.0

Stars: 10

Forks: 0

Open issues: 1

Created: 2024-09-02T10:00:39Z

Pushed: 2025-06-16T07:16:51Z

Default branch: main

Fork: no

Archived: yes

README:

Nebius AI connector for Apache Spark™

The Managed Service for Apache Spark, a Nebius AI service, offers access to _sessions:_ managed environments that can handle multiple independent ad-hoc computations at the same time.

With this connector, you can connect to your Managed Spark sessions using Spark Connect and process data with Spark APIs from your machine.

Installing

pip install nebius-connect

Example

from pyspark.sql.connect.session import SparkSession
from nebius.spark.connect import create_channel_builder

nebius_spark_cb = create_channel_builder(
'spsession-example123.nebius.cloud:443',
password='my-password'
)

spark = SparkSession \
.builder \
.channelBuilder(nebius_spark_cb) \
.getOrCreate()

columns = ["id","name"]
data = [(1,"Sarah"),(2,"Maria")]
df = spark.createDataFrame(data).toDF(*columns)
df.show()
spark.stop()

License

Copyright 2024 Nebius B.V.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

_Apache and Apache Spark are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries._

Notability

notability 1.0/10

Minor repo with 10 stars