Documents#
ragbits.document_search.documents.document.Document
#
Bases: BaseModel
An object representing a document which is downloaded and stored locally.
from_document_meta
classmethod
#
from_document_meta(document_meta: DocumentMeta, local_path: Path) -> Document
Create a document from a document metadata. Based on the document type, it will return a different object.
PARAMETER | DESCRIPTION |
---|---|
document_meta |
The document metadata.
TYPE:
|
local_path |
The local path to the document.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
Document
|
The document. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
ragbits.document_search.documents.document.TextDocument
#
Bases: Document
An object representing a text document.
content
property
#
Get the content of the document.
RETURNS | DESCRIPTION |
---|---|
str
|
The content of the document. |
from_document_meta
classmethod
#
from_document_meta(document_meta: DocumentMeta, local_path: Path) -> Document
Create a document from a document metadata. Based on the document type, it will return a different object.
PARAMETER | DESCRIPTION |
---|---|
document_meta |
The document metadata.
TYPE:
|
local_path |
The local path to the document.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
Document
|
The document. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
ragbits.document_search.documents.document.DocumentMeta
#
Bases: BaseModel
An object representing a document metadata.
fetch
async
#
fetch() -> Document
This method fetches the document from source (potentially remote) and creates an object to interface with it. Based on the document type, it will return a different object.
RETURNS | DESCRIPTION |
---|---|
Document
|
The document. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
create_text_document_from_literal
classmethod
#
create_text_document_from_literal(content: str) -> DocumentMeta
Create a text document from a literal content. This method is deprecated, use from_literal() instead.
PARAMETER | DESCRIPTION |
---|---|
content |
The content of the document.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
DocumentMeta
|
The document metadata. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
from_literal
classmethod
#
from_literal(content: str) -> DocumentMeta
Create a text document from a literal content.
PARAMETER | DESCRIPTION |
---|---|
content |
The content of the document.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
DocumentMeta
|
The document metadata. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
from_local_path
classmethod
#
from_local_path(local_path: Path) -> DocumentMeta
Create a document metadata from a local path.
PARAMETER | DESCRIPTION |
---|---|
local_path |
The local path to the document.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
DocumentMeta
|
The document metadata. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
from_source
async
classmethod
#
from_source(source: Source) -> DocumentMeta
Create a document metadata from a source.
PARAMETER | DESCRIPTION |
---|---|
source |
The source from which the document is fetched.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
DocumentMeta
|
The document metadata. |
Source code in packages/ragbits-document-search/src/ragbits/document_search/documents/document.py
ragbits.document_search.documents.document.DocumentType
#
Bases: str
, Enum
Document types that can be parsed.