Method: corpora.documents.chunks.create
Creates a Chunk
.
Endpoint
posthttps: / /generativelanguage.googleapis.com /v1beta /{parent=corpora /* /documents /*} /chunks
The URL uses gRPC Transcoding syntax.
Path parameters
parent
string
Required. The name of the Document
where this Chunk
will be created. Example: corpora/my-corpus-123/documents/the-doc-abc
It takes the form corpora/{corpora}/documents/{document}
.
Request body
The request body contains an instance of Chunk
.
name
string
Immutable. Identifier. The Chunk
resource name. The ID (name excluding the "corpora/*/documents/*/chunks/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a random 12-character unique ID will be generated. Example: corpora/{corpus_id}/documents/{document_id}/chunks/123a456b789c
Required. The content for the Chunk
, such as the text string. The maximum number of tokens per chunk is 2043.
Optional. User provided custom metadata stored as key-value pairs. The maximum number of CustomMetadata
per chunk is 20.
Response body
If successful, the response body contains a newly created instance of Chunk
.
Method: corpora.documents.chunks.list
Lists all Chunk
s in a Document
.
Endpoint
gethttps: / /generativelanguage.googleapis.com /v1beta /{parent=corpora /* /documents /*} /chunks
The URL uses gRPC Transcoding syntax.
Path parameters
parent
string
Required. The name of the Document
containing Chunk
s. Example: corpora/my-corpus-123/documents/the-doc-abc
It takes the form corpora/{corpora}/documents/{document}
.
Query parameters
pageSize
integer
Optional. The maximum number of Chunk
s to return (per page). The service may return fewer Chunk
s.
If unspecified, at most 10 Chunk
s will be returned. The maximum size limit is 100 Chunk
s per page.
pageToken
string
Optional. A page token, received from a previous chunks.list
call.
Provide the nextPageToken
returned in the response as an argument to the next request to retrieve the next page.
When paginating, all other parameters provided to chunks.list
must match the call that provided the page token.
Request body
The request body must be empty.
Response body
Response from chunks.list
containing a paginated list of Chunk
s. The Chunk
s are sorted by ascending chunk.create_time
.
If successful, the response body contains data with the following structure:
The returned Chunk
s.
nextPageToken
string
A token, which can be sent as pageToken
to retrieve the next page. If this field is omitted, there are no more pages.
JSON representation |
---|
{
"chunks": [
{
object ( |
Method: corpora.documents.chunks.get
Gets information about a specific Chunk
.
Endpoint
gethttps: / /generativelanguage.googleapis.com /v1beta /{name=corpora /* /documents /* /chunks /*}
The URL uses gRPC Transcoding syntax.
Path parameters
name
string
Required. The name of the Chunk
to retrieve. Example: corpora/my-corpus-123/documents/the-doc-abc/chunks/some-chunk
It takes the form corpora/{corpora}/documents/{document}/chunks/{chunk}
.
Request body
The request body must be empty.
Response body
If successful, the response body contains an instance of Chunk
.
Method: corpora.documents.chunks.patch
Updates a Chunk
.
Endpoint
patchhttps: / /generativelanguage.googleapis.com /v1beta /{chunk.name=corpora /* /documents /* /chunks /*}
PATCH https://generativelanguage.googleapis.com/v1beta/{chunk.name=corpora/*/documents/*/chunks/*}
The URL uses gRPC Transcoding syntax.
Path parameters
chunk.name
string
Immutable. Identifier. The Chunk
resource name. The ID (name excluding the "corpora/*/documents/*/chunks/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a random 12-character unique ID will be generated. Example: corpora/{corpus_id}/documents/{document_id}/chunks/123a456b789c
It takes the form corpora/{corpora}/documents/{document}/chunks/{chunk}
.
Query parameters
Required. The list of fields to update. Currently, this only supports updating customMetadata
and data
.
This is a comma-separated list of fully qualified names of fields. Example: "user.displayName,photo"
.
Request body
The request body contains an instance of Chunk
.
Required. The content for the Chunk
, such as the text string. The maximum number of tokens per chunk is 2043.
Optional. User provided custom metadata stored as key-value pairs. The maximum number of CustomMetadata
per chunk is 20.
Response body
If successful, the response body contains an instance of Chunk
.
Method: corpora.documents.chunks.delete
Deletes a Chunk
.
Endpoint
deletehttps: / /generativelanguage.googleapis.com /v1beta /{name=corpora /* /documents /* /chunks /*}
The URL uses gRPC Transcoding syntax.
Path parameters
name
string
Required. The resource name of the Chunk
to delete. Example: corpora/my-corpus-123/documents/the-doc-abc/chunks/some-chunk
It takes the form corpora/{corpora}/documents/{document}/chunks/{chunk}
.
Request body
The request body must be empty.
Response body
If successful, the response body is empty.
Method: corpora.documents.chunks.batchCreate
Batch create Chunk
s.
Endpoint
posthttps: / /generativelanguage.googleapis.com /v1beta /{parent=corpora /* /documents /*} /chunks:batchCreate
The URL uses gRPC Transcoding syntax.
Path parameters
parent
string
Optional. The name of the Document
where this batch of Chunk
s will be created. The parent field in every CreateChunkRequest
must match this value. Example: corpora/my-corpus-123/documents/the-doc-abc
It takes the form corpora/{corpora}/documents/{document}
.
Request body
The request body contains data with the following structure:
Required. The request messages specifying the Chunk
s to create. A maximum of 100 Chunk
s can be created in a batch.
Response body
Response from chunks.batchCreate
containing a list of created Chunk
s.
If successful, the response body contains data with the following structure:
Chunk
s created.
JSON representation |
---|
{
"chunks": [
{
object ( |
CreateChunkRequest
Method: corpora.documents.chunks.batchUpdate
Batch update Chunk
s.
Endpoint
posthttps: / /generativelanguage.googleapis.com /v1beta /{parent=corpora /* /documents /*} /chunks:batchUpdate
The URL uses gRPC Transcoding syntax.
Path parameters
parent
string
Optional. The name of the Document
containing the Chunk
s to update. The parent field in every UpdateChunkRequest
must match this value. Example: corpora/my-corpus-123/documents/the-doc-abc
It takes the form corpora/{corpora}/documents/{document}
.
Request body
The request body contains data with the following structure:
Required. The request messages specifying the Chunk
s to update. A maximum of 100 Chunk
s can be updated in a batch.
Response body
Response from chunks.batchUpdate
containing a list of updated Chunk
s.
If successful, the response body contains data with the following structure:
Chunk
s updated.
JSON representation |
---|
{
"chunks": [
{
object ( |
UpdateChunkRequest
Request to update a Chunk
.
Required. The Chunk
to update.
Required. The list of fields to update. Currently, this only supports updating customMetadata
and data
.
This is a comma-separated list of fully qualified names of fields. Example: "user.displayName,photo"
.
JSON representation |
---|
{
"chunk": {
object ( |
Method: corpora.documents.chunks.batchDelete
Batch delete Chunk
s.
Endpoint
posthttps: / /generativelanguage.googleapis.com /v1beta /{parent=corpora /* /documents /*} /chunks:batchDelete
The URL uses gRPC Transcoding syntax.
Path parameters
parent
string
Optional. The name of the Document
containing the Chunk
s to delete. The parent field in every DeleteChunkRequest
must match this value. Example: corpora/my-corpus-123/documents/the-doc-abc
It takes the form corpora/{corpora}/documents/{document}
.
Request body
The request body contains data with the following structure:
Required. The request messages specifying the Chunk
s to delete.
Response body
If successful, the response body is empty.
DeleteChunkRequest
Request to delete a Chunk
.
name
string
Required. The resource name of the Chunk
to delete. Example: corpora/my-corpus-123/documents/the-doc-abc/chunks/some-chunk
JSON representation |
---|
{ "name": string } |
REST Resource: corpora.documents.chunks
Resource: Chunk
A Chunk
is a subpart of a Document
that is treated as an independent unit for the purposes of vector representation and storage. A Corpus
can have a maximum of 1 million Chunk
s.
name
string
Immutable. Identifier. The Chunk
resource name. The ID (name excluding the "corpora/*/documents/*/chunks/" prefix) can contain up to 40 characters that are lowercase alphanumeric or dashes (-). The ID cannot start or end with a dash. If the name is empty on create, a random 12-character unique ID will be generated. Example: corpora/{corpus_id}/documents/{document_id}/chunks/123a456b789c
Required. The content for the Chunk
, such as the text string. The maximum number of tokens per chunk is 2043.
Optional. User provided custom metadata stored as key-value pairs. The maximum number of CustomMetadata
per chunk is 20.
Output only. The Timestamp of when the Chunk
was created.
A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: "2014-10-02T15:01:23Z"
and "2014-10-02T15:01:23.045123456Z"
.
Output only. The Timestamp of when the Chunk
was last updated.
A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: "2014-10-02T15:01:23Z"
and "2014-10-02T15:01:23.045123456Z"
.
Output only. Current state of the Chunk
.
JSON representation |
---|
{ "name": string, "data": { object ( |
ChunkData
Extracted data that represents the Chunk
content.
data
Union type
data
can be only one of the following:stringValue
string
The Chunk
content as a string. The maximum number of tokens per chunk is 2043.
JSON representation |
---|
{ // data "stringValue": string // Union type } |
State
States for the lifecycle of a Chunk
.
Enums | |
---|---|
STATE_UNSPECIFIED |
The default value. This value is used if the state is omitted. |
STATE_PENDING_PROCESSING |
Chunk is being processed (embedding and vector storage). |
STATE_ACTIVE |
Chunk is processed and available for querying. |
STATE_FAILED |
Chunk failed processing. |