Data Sources
List Data Sources for a Knowledge Base
Add Data Source to a Knowledge Base
Delete a Data Source from a Knowledge Base
Create Presigned URLs for Data Source File Upload
ModelsExpand Collapse
APIFileUploadDataSource { original_file_name, size_in_bytes, stored_object_key } File to upload as data source for knowledge base.
File to upload as data source for knowledge base.
The original file name
The size of the file in bytes
The object key the file was stored as
APIKnowledgeBaseDataSource { aws_data_source, bucket_name, created_at, 11 more } Data Source configuration for Knowledge Bases
Data Source configuration for Knowledge Bases
aws_data_source?: AwsDataSource { bucket_name, item_path, region } AWS S3 Data Source for Display
AWS S3 Data Source for Display
Spaces bucket name
Region of bucket
Name of storage bucket - Deprecated, moved to data_source_details
Creation date / time
dropbox_data_source?: DropboxDataSource { folder } Dropbox Data Source for Display
Dropbox Data Source for Display
file_upload_data_source?: APIFileUploadDataSource { original_file_name, size_in_bytes, stored_object_key } File to upload as data source for knowledge base.
File to upload as data source for knowledge base.
The original file name
The size of the file in bytes
The object key the file was stored as
google_drive_data_source?: GoogleDriveDataSource { folder_id, folder_name } Google Drive Data Source for Display
Google Drive Data Source for Display
Name of the selected folder if available
Path of folder or object in bucket - Deprecated, moved to data_source_details
last_datasource_indexing_job?: APIIndexedDataSource { completed_at, data_source_uuid, error_details, 11 more }
Timestamp when data source completed indexing
Uuid of the indexed data source
A detailed error description
A string code provinding a hint which part of the system experienced an error
Total count of files that have failed
Total count of files that have been indexed
Total count of files that have been indexed
Total count of files that have been removed
Total count of files that have been skipped
Timestamp when data source started indexing
status?: "DATA_SOURCE_STATUS_UNKNOWN" | "DATA_SOURCE_STATUS_IN_PROGRESS" | "DATA_SOURCE_STATUS_UPDATED" | 4 more
Total size of files in data source in bytes
Total size of files in data source in bytes that have been indexed
Total file count in the data source
IndexingJob description
IndexingJob description
Number of datasources indexed completed
Creation date / time
data_source_jobs?: Array<APIIndexedDataSource { completed_at, data_source_uuid, error_details, 11 more } >Details on Data Sources included in the Indexing Job
Details on Data Sources included in the Indexing Job
Timestamp when data source completed indexing
Uuid of the indexed data source
A detailed error description
A string code provinding a hint which part of the system experienced an error
Total count of files that have failed
Total count of files that have been indexed
Total count of files that have been indexed
Total count of files that have been removed
Total count of files that have been skipped
Timestamp when data source started indexing
status?: "DATA_SOURCE_STATUS_UNKNOWN" | "DATA_SOURCE_STATUS_IN_PROGRESS" | "DATA_SOURCE_STATUS_UPDATED" | 4 more
Total size of files in data source in bytes
Total size of files in data source in bytes that have been indexed
Total file count in the data source
Boolean value to determine if the indexing job details are available
Knowledge base id
phase?: "BATCH_JOB_PHASE_UNKNOWN" | "BATCH_JOB_PHASE_PENDING" | "BATCH_JOB_PHASE_RUNNING" | 4 more
status?: "INDEX_JOB_STATUS_UNKNOWN" | "INDEX_JOB_STATUS_PARTIAL" | "INDEX_JOB_STATUS_IN_PROGRESS" | 4 more
Number of tokens [This field is deprecated]
Number of datasources being indexed
Total Items Failed
Total Items Indexed
Total Items Removed
Total Items Skipped
Total Tokens Consumed By the Indexing Job
Last modified
Unique id
Region code - Deprecated, moved to data_source_details
Spaces Bucket Data Source
Spaces Bucket Data Source
Spaces bucket name
Region of bucket
Last modified
Unique id of knowledge base
web_crawler_data_source?: APIWebCrawlerDataSource { base_url, crawling_option, embed_media, exclude_tags } WebCrawlerDataSource
WebCrawlerDataSource
The base url to crawl.
crawling_option?: "UNKNOWN" | "SCOPED" | "PATH" | 2 moreOptions for specifying how URLs found on pages should be handled.
- UNKNOWN: Default unknown value
- SCOPED: Only include the base URL.
- PATH: Crawl the base URL and linked pages within the URL path.
- DOMAIN: Crawl the base URL and linked pages within the same domain.
- SUBDOMAINS: Crawl the base URL and linked pages for any subdomain.
Options for specifying how URLs found on pages should be handled.
- UNKNOWN: Default unknown value
- SCOPED: Only include the base URL.
- PATH: Crawl the base URL and linked pages within the URL path.
- DOMAIN: Crawl the base URL and linked pages within the same domain.
- SUBDOMAINS: Crawl the base URL and linked pages for any subdomain.
Whether to ingest and index media (images, etc.) on web pages.
Declaring which tags to exclude in web pages while webcrawling
APISpacesDataSource { bucket_name, item_path, region } Spaces Bucket Data Source
Spaces Bucket Data Source
Spaces bucket name
Region of bucket
APIWebCrawlerDataSource { base_url, crawling_option, embed_media, exclude_tags } WebCrawlerDataSource
WebCrawlerDataSource
The base url to crawl.
crawling_option?: "UNKNOWN" | "SCOPED" | "PATH" | 2 moreOptions for specifying how URLs found on pages should be handled.
- UNKNOWN: Default unknown value
- SCOPED: Only include the base URL.
- PATH: Crawl the base URL and linked pages within the URL path.
- DOMAIN: Crawl the base URL and linked pages within the same domain.
- SUBDOMAINS: Crawl the base URL and linked pages for any subdomain.
Options for specifying how URLs found on pages should be handled.
- UNKNOWN: Default unknown value
- SCOPED: Only include the base URL.
- PATH: Crawl the base URL and linked pages within the URL path.
- DOMAIN: Crawl the base URL and linked pages within the same domain.
- SUBDOMAINS: Crawl the base URL and linked pages for any subdomain.
Whether to ingest and index media (images, etc.) on web pages.
Declaring which tags to exclude in web pages while webcrawling
AwsDataSource { bucket_name, item_path, key_id, 2 more } AWS S3 Data Source
AWS S3 Data Source
Spaces bucket name
The AWS Key ID
Region of bucket
The AWS Secret Key