RemoveSegID / SegIDClean & Reflow Instructions — ATR Last-Mile Cleanup Utility v1.0
Batch 16 Modernization Date: 28 April 2026
Status: Live operational instruction section

PURPOSE
Use this section when cleaning text exported from ATR translation/review prompts that include SegID headers, PARA markers, Clean Copy banners, QA headings, or citation crumbs.

This is a last-mile cleanup workflow. It must not be used to rewrite, translate, summarize, or alter the meaning of the translated text.

LEGACY PRESERVED QUICK WORKFLOW
The older workflow instructed users to:

1. Copy the reviewed Clean Copy output.
2. Paste it into a Notepad file.
3. Save it as a UTF-8 .txt file.
4. Unzip and run RemoveSegID.exe from RemoveSegID.zip.
5. Choose Mode 3 when using that older tool.
6. Drag and drop the .txt file into the console window instead of manually typing the path.
7. Press Enter.
8. Use the cleaned output file created next to the source file, such as filename_cleaned.txt.

CURRENT SEGIDCLEAN & REFLOW WORKFLOW
The newer SegIDClean & Reflow utility is preferred when available.

Expected function:

- removes SegID headers/prefixes, with or without trailing dot or colon;
- strips “Clean Copy” banners and QA/report headings;
- deletes inline citation crumbs such as [ISO][1] or Wikipedia+2 when they are artifacts, not source text;
- preserves paragraph breaks using PARA/SegID markers;
- collapses extra blank lines;
- writes a cleaned file next to the source, such as draft.cleaned.txt;
- if PARA markers are present, emits a second pass with original paragraphing reconstituted as continuous prose, such as draft.cleaned.reflowed.txt.

SAFETY BOUNDARY
The cleanup utility edits exported text only. It must never be treated as an authority over the source translation.

Do not use cleanup to:

- fix mistranslations;
- remove difficult content;
- merge sections that should remain separate;
- erase original paragraphing without a reflow basis;
- delete quotations or source labels;
- remove footnotes that are real article content;
- remove bracketed translator notes such as 【译按：…】 when they are intended content;
- remove original-script quotations;
- remove intentional stage labels, speaker labels, or headings.

BEFORE CLEANUP
Confirm:

1. The translation/review output is complete.
2. The Clean Copy region is the intended region to clean.
3. The file is saved as UTF-8 text.
4. The file extension is .txt unless the tool explicitly supports another format.
5. You have a copy of the pre-cleaned file.
6. PARA markers, if present, are consistent.
7. SegID markers are not part of meaningful source text.

AFTER CLEANUP QA
Open the cleaned file and check:

- title is present;
- first paragraph is present;
- middle section is present;
- final paragraph is present;
- no section was silently removed;
- paragraphs are not collapsed into one unreadable block;
- poems/verses/lists did not lose intentional lineation;
- quotes and source labels remain;
- diacritics remain intact;
- Chinese characters remain intact;
- links remain intact if present;
- no QA-report headings remain unless intentionally retained;
- no SegID/PARA markers remain unless intentionally retained.

WHEN TO USE THE REFLOWED OUTPUT
Use the .reflowed output when:

- PARA markers were used correctly;
- you want continuous prose with original paragraphing restored;
- the output will be pasted into a normal article, book, or blog post.

Do not use the reflowed output without checking when:

- the text contains verse, gāthās, poetry, transcript lineation, tables, code, or prompt bodies;
- line breaks carry meaning;
- the source uses intentionally short standalone lines;
- the cleanup result visually changes structure.

WHEN TO RETURN TO PROMPT 6 / PROMPT 9 INSTEAD
If the cleaned text reveals missing paragraphs, mistranslations, inconsistent terminology, broken quotes, bad paragraphing, or doctrinal errors, do not keep cleaning mechanically. Return to Prompt 6 for review or Prompt 9 for source-anchored refinement.

SIMPLIFIED / TRADITIONAL CHINESE CONVERSION NOTE
If using the separate Simplified-to-Traditional / Traditional-to-Simplified converter:

1. Treat it as character conversion only, not translation.
2. Check Buddhist technical terms after conversion.
3. Check proper nouns, names, Sanskrit/Pāli/Tibetan transliterations, and quoted classical passages.
4. Do not assume automated conversion preserves regional terminology preferences.
5. Keep a copy of the pre-conversion file.

FINAL STATUS
A cleanup step is complete only after the cleaned file is opened and checked. Do not claim the final text is clean merely because the utility produced an output file.