| Method | HTTP request | Release Stage |
|---|---|---|
| create | POST /v1/datasets | Stable |
| delete_schema | DELETE /v1/datasets/{datasetRid}/schema | Private Beta |
| get | GET /v1/datasets/{datasetRid} | Stable |
| get_schema | GET /v1/datasets/{datasetRid}/schema | Private Beta |
| read | GET /v1/datasets/{datasetRid}/readTable | Stable |
| replace_schema | PUT /v1/datasets/{datasetRid}/schema | Private Beta |
Creates a new Dataset. A default branch - master for most enrollments - will be created on the Dataset.
| Name | Type | Description | Notes |
|---|---|---|---|
| name | DatasetName | ||
| parent_folder_rid | FolderRid |
Dataset
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetName
name = "My Dataset"
# FolderRid
parent_folder_rid = "ri.foundry.main.folder.bfe58487-4c56-4c58-aba7-25defd6163c4"
try:
api_response = client.datasets.Dataset.create(name=name, parent_folder_rid=parent_folder_rid)
print("The create response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.create: %s\n" % e)See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 200 | Dataset | application/json |
[Back to top] [Back to API list] [Back to Model list] [Back to README]
Deletes the Schema from a Dataset and Branch.
| Name | Type | Description | Notes |
|---|---|---|---|
| dataset_rid | DatasetRid | The RID of the Dataset on which to delete the schema. | |
| branch_id | Optional[BranchId] | The ID of the Branch on which to delete the schema. | [optional] |
| preview | Optional[PreviewMode] | [optional] | |
| transaction_rid | Optional[TransactionRid] | The RID of the Transaction on which to delete the schema. | [optional] |
None
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetRid | The RID of the Dataset on which to delete the schema.
dataset_rid = None
# Optional[BranchId] | The ID of the Branch on which to delete the schema.
branch_id = None
# Optional[PreviewMode]
preview = True
# Optional[TransactionRid] | The RID of the Transaction on which to delete the schema.
transaction_rid = None
try:
api_response = client.datasets.Dataset.delete_schema(
dataset_rid, branch_id=branch_id, preview=preview, transaction_rid=transaction_rid
)
print("The delete_schema response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.delete_schema: %s\n" % e)See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 204 | None | Schema deleted. | None |
[Back to top] [Back to API list] [Back to Model list] [Back to README]
Gets the Dataset with the given DatasetRid.
| Name | Type | Description | Notes |
|---|---|---|---|
| dataset_rid | DatasetRid |
Dataset
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetRid
dataset_rid = "ri.foundry.main.dataset.c26f11c8-cdb3-4f44-9f5d-9816ea1c82da"
try:
api_response = client.datasets.Dataset.get(dataset_rid)
print("The get response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.get: %s\n" % e)See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 200 | Dataset | application/json |
[Back to top] [Back to API list] [Back to Model list] [Back to README]
Retrieves the Schema for a Dataset and Branch, if it exists.
| Name | Type | Description | Notes |
|---|---|---|---|
| dataset_rid | DatasetRid | The RID of the Dataset. | |
| branch_id | Optional[BranchId] | The ID of the Branch. | [optional] |
| preview | Optional[PreviewMode] | [optional] | |
| transaction_rid | Optional[TransactionRid] | The TransactionRid that contains the Schema. | [optional] |
Optional[Any]
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetRid | The RID of the Dataset.
dataset_rid = None
# Optional[BranchId] | The ID of the Branch.
branch_id = None
# Optional[PreviewMode]
preview = True
# Optional[TransactionRid] | The TransactionRid that contains the Schema.
transaction_rid = None
try:
api_response = client.datasets.Dataset.get_schema(
dataset_rid, branch_id=branch_id, preview=preview, transaction_rid=transaction_rid
)
print("The get_schema response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.get_schema: %s\n" % e)See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 200 | Optional[Any] | application/json |
[Back to top] [Back to API list] [Back to Model list] [Back to README]
Gets the content of a dataset as a table in the specified format.
This endpoint currently does not support views (virtual datasets composed of other datasets). For more information, refer to the views documentation.
| Name | Type | Description | Notes |
|---|---|---|---|
| dataset_rid | DatasetRid | The RID of the Dataset. | |
| format | TableExportFormat | The export format. Must be ARROW or CSV. |
|
| branch_id | Optional[BranchId] | The identifier (name) of the Branch. | [optional] |
| columns | Optional[List[str]] | A subset of the dataset columns to include in the result. Defaults to all columns. | [optional] |
| end_transaction_rid | Optional[TransactionRid] | The Resource Identifier (RID) of the end Transaction. | [optional] |
| row_limit | Optional[int] | A limit on the number of rows to return. Note that row ordering is non-deterministic. | [optional] |
| start_transaction_rid | Optional[TransactionRid] | The Resource Identifier (RID) of the start Transaction. | [optional] |
bytes
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetRid | The RID of the Dataset.
dataset_rid = None
# TableExportFormat | The export format. Must be `ARROW` or `CSV`.
format = "CSV"
# Optional[BranchId] | The identifier (name) of the Branch.
branch_id = None
# Optional[List[str]] | A subset of the dataset columns to include in the result. Defaults to all columns.
columns = None
# Optional[TransactionRid] | The Resource Identifier (RID) of the end Transaction.
end_transaction_rid = None
# Optional[int] | A limit on the number of rows to return. Note that row ordering is non-deterministic.
row_limit = None
# Optional[TransactionRid] | The Resource Identifier (RID) of the start Transaction.
start_transaction_rid = None
try:
api_response = client.datasets.Dataset.read(
dataset_rid,
format=format,
branch_id=branch_id,
columns=columns,
end_transaction_rid=end_transaction_rid,
row_limit=row_limit,
start_transaction_rid=start_transaction_rid,
)
print("The read response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.read: %s\n" % e)import foundry
from foundry.models import TableExportFormat
from foundry import PalantirRPCException
foundry_client = foundry.FoundryV1Client(auth=foundry.UserTokenAuth(...), hostname="example.palantirfoundry.com")
try:
api_response = foundry_client.datasets.Dataset.read(
dataset_rid="...", format="CSV", columns=[...]
)
with open("my_table.csv", "wb") as f:
f.write(api_response)
except PalantirRPCException as e:
print("PalantirRPCException when calling DatasetsApiServiceApi -> read: %s\n" % e)Important
For this example to work, you will need to have pyarrow installed in your Python environment.
import foundry
from foundry.models import TableExportFormat
from foundry import PalantirRPCException
import pyarrow as pa
foundry_client = foundry.FoundryV1Client(auth=foundry.UserTokenAuth(...), hostname="example.palantirfoundry.com")
try:
api_response = foundry_client.datasets.Dataset.read(dataset_rid="...", format="ARROW", columns=[...])
df = pa.ipc.open_stream(api_response).read_all().to_pandas()
print(df)
except Exception as e:
print("Exception when calling DatasetsApiServiceApi -> read: %s\n" % e) id word length double boolean
0 0 A 1.0 11.878200 1
1 1 a 1.0 11.578800 0
2 2 aa 2.0 15.738500 1
3 3 aal 3.0 6.643900 0
4 4 aalii 5.0 2.017730 1
... ... ... ... ... ...
235881 235881 zythem 6.0 19.427400 1
235882 235882 Zythia 6.0 14.397100 1
235883 235883 zythum 6.0 3.385820 0
235884 235884 Zyzomys 7.0 6.208830 1
235885 235885 Zyzzogeton 10.0 0.947821 0
[235886 rows x 5 columns]
See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 200 | bytes | The content stream. | / |
[Back to top] [Back to API list] [Back to Model list] [Back to README]
Puts a Schema on an existing Dataset and Branch.
| Name | Type | Description | Notes |
|---|---|---|---|
| dataset_rid | DatasetRid | The RID of the Dataset on which to put the Schema. | |
| body | Any | Body of the request | |
| branch_id | Optional[BranchId] | The ID of the Branch on which to put the Schema. | [optional] |
| preview | Optional[PreviewMode] | [optional] |
None
from foundry_sdk.v1 import FoundryClient
import foundry_sdk
from pprint import pprint
client = FoundryClient(auth=foundry_sdk.UserTokenAuth(...), hostname="example.palantirfoundry.com")
# DatasetRid | The RID of the Dataset on which to put the Schema.
dataset_rid = None
# Any | Body of the request
body = None
# Optional[BranchId] | The ID of the Branch on which to put the Schema.
branch_id = None
# Optional[PreviewMode]
preview = True
try:
api_response = client.datasets.Dataset.replace_schema(
dataset_rid, body, branch_id=branch_id, preview=preview
)
print("The replace_schema response:\n")
pprint(api_response)
except foundry_sdk.PalantirRPCException as e:
print("HTTP error when calling Dataset.replace_schema: %s\n" % e)See README
| Status Code | Type | Description | Content Type |
|---|---|---|---|
| 204 | None | None |
[Back to top] [Back to API list] [Back to Model list] [Back to README]