You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: mkdocs/docs/contributing.md
+16Lines changed: 16 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -58,6 +58,22 @@ For IDEA ≤2021 you need to install the [Poetry integration as a plugin](https:
58
58
59
59
Now you're set using Poetry, and all the tests will run in Poetry, and you'll have syntax highlighting in the pyproject.toml to indicate stale dependencies.
| sql-postgres | Support for SQL Catalog backed by Postgresql |
49
+
| sql-sqlite | Support for SQL Catalog backed by SQLite |
50
+
| pyarrow | PyArrow as a FileIO implementation to interact with the object store |
51
+
| pandas | Installs both PyArrow and Pandas |
52
+
| duckdb | Installs both PyArrow and DuckDB |
53
+
| ray | Installs PyArrow, Pandas, and Ray |
54
+
| s3fs | S3FS as a FileIO implementation to interact with the object store |
55
+
| adlfs | ADLFS as a FileIO implementation to interact with the object store |
56
+
| snappy | Support for snappy Avro compression |
57
+
| gcs | GCS as the FileIO implementation to interact with the object store |
58
+
59
+
You either need to install `s3fs`, `adlfs`, `gcs`, or `pyarrow` to be able to fetch files from an object store.
60
+
61
+
## Connecting to a catalog
62
+
63
+
Iceberg leverages the [catalog to have one centralized place to organize the tables](https://iceberg.apache.org/catalog/). This can be a traditional Hive catalog to store your Iceberg tables next to the rest, a vendor solution like the AWS Glue catalog, or an implementation of Icebergs' own [REST protocol](https://github.com/apache/iceberg/tree/main/open-api). Checkout the [configuration](configuration.md) page to find all the configuration details.
42
64
65
+
## Write a PyArrow dataframe
66
+
67
+
Let's take the Taxi dataset, and write this to an Iceberg table.
0 commit comments