[FEA]: to_markdown/to_markdown_by_page should differentiate by distinct document ingested

### Is this a new feature, an improvement, or a change to existing functionality?

New Feature

### How would you describe the priority of this feature request

Significant improvement

### Please provide a clear description of problem this feature solves

Using [the snippet](https://github.com/randerzander/NeMo-Retriever/tree/main/nemo_retriever#ingest-a-test-pdf), if you ingest a single document, the markdown conversion makes sense.

However, if your ingestion job contained multiple documents, there's no way to differentiate returns for different documents

For example, if you ingest multimodal_test.pdf and an additional single page PDF, to_markdown_by_page will return what looks like a representation of a 4 page single document.

### Describe the feature, and optionally a solution or implementation and any alternatives

 Both to_markdown and to_markdown_by page should probably include a source_filename field by which chunks are grouped.

### Additional context

 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA]: to_markdown/to_markdown_by_page should differentiate by distinct document ingested #1630

Is this a new feature, an improvement, or a change to existing functionality?

How would you describe the priority of this feature request

Please provide a clear description of problem this feature solves

Describe the feature, and optionally a solution or implementation and any alternatives

Additional context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[FEA]: to_markdown/to_markdown_by_page should differentiate by distinct document ingested #1630

Description

Is this a new feature, an improvement, or a change to existing functionality?

How would you describe the priority of this feature request

Please provide a clear description of problem this feature solves

Describe the feature, and optionally a solution or implementation and any alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions