Skip to content

PR3: implement document processing pipeline#4

Merged
NeurArk merged 1 commit into
mainfrom
neurark/réaliser-pr3-selon-la-documentation
May 26, 2025

Hidden character warning

The head ref may contain hidden characters: "neurark/r\u00e9aliser-pr3-selon-la-documentation"
Merged

PR3: implement document processing pipeline#4
NeurArk merged 1 commit into
mainfrom
neurark/réaliser-pr3-selon-la-documentation

Conversation

@NeurArk
Copy link
Copy Markdown
Owner

@NeurArk NeurArk commented May 26, 2025

Summary

  • add document extraction modules and processor
  • implement text chunking and cleaning utilities
  • update TODO for PR3 completion
  • add unit tests for document processing

Testing

  • pytest -q

@NeurArk NeurArk merged commit 3cab3eb into main May 26, 2025
1 check passed
@NeurArk NeurArk deleted the neurark/réaliser-pr3-selon-la-documentation branch May 26, 2025 15:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant