feat(oss): MinIO 三桶分离,文档上传路由至 RAW 桶 (LINK-215)#140
Merged
Conversation
三桶模型:tolink-public(公开)/ tolink-rag-raw(源文件)/ tolink-rag-docs(解析产物)。 - OssSavePlaceEnum 新增 RAW 值,Javadoc 描述三桶职责 - OssProperties.Minio 新增 rawBucketName 字段(getter/setter) - MinioFileService.resolveBucketName 增加 RAW 分支;returnsObjectKeyOnly 同时覆盖 RAW;validateBucketNames 改为全三桶去重校验 - OssUploadRuleRegistry "document" 规则由 PRIVATE 改为 RAW - DocumentUploadAsyncExecutor.runUpload 上传目标改为 RAW - DocumentFileServiceImpl.resolveTargetRecord 两处 getBucketName 改为 RAW - DocumentParseTaskServiceImpl.MD_BUCKET 修正为 "tolink-rag-docs"(原值 "rag-md" 有误) - MinioFileServiceTest 更新错误消息断言、新增 RAW bucket 分辨率断言
PrivateFileResolver.getPrivateFile 硬编码 PRIVATE 桶,但文档原文件 已改存入 RAW 桶,导致 openOriginalFile 下载时找不到对象。 - PrivateFileResolver 新增带 place 参数的重载,原无参方法委托调用 - DocumentFileServiceImpl.openOriginalFile 改传 OssSavePlaceEnum.RAW
Merged
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
背景
LINK-215:用户上传的原文件(PDF、DOCX 等)和 Python 解析产物(Markdown、图片)混存于同一个 MinIO 桶
tolink-rag-docs,给权限隔离、生命周期管理和数据追溯带来困难。改动内容
三桶模型(与 Python 端 LinkRag 同步对齐)
tolink-rag-rawtolink-rag-docstolink-public文件变更
OssSavePlaceEnum:新增RAW枚举值,Javadoc 描述三桶职责OssProperties.Minio:新增rawBucketName字段(getter/setter)MinioFileService:resolveBucketName()增加RAW分支returnsObjectKeyOnly()同时覆盖RAW(不生成公开 URL)validateBucketNames()改为对三桶名称做去重校验OssUploadRuleRegistry:"document"规则由PRIVATE改为RAWDocumentUploadAsyncExecutor:文档上传目标由PRIVATE改为RAWDocumentFileServiceImpl:两处getBucketName(PRIVATE)改为getBucketName(RAW)DocumentParseTaskServiceImpl:MD_BUCKET常量由错误值"rag-md"修正为"tolink-rag-docs"application-dev.yml(gitignore,本地配置):已添加raw-bucket-name: tolink-rag-raw测试
MinioFileServiceTest:更新重复桶名错误消息断言;新增RAW桶名分辨率断言DocumentParseTaskServiceImplTest:无需修改(重试路径的md_bucket来自DocumentParsedLog历史数据,不依赖常量)关联