name: manuscript-audit description: "Audit and polish scientific manuscripts for journal submission. Use this skill whenever the user has a manuscript draft (docx, pdf, or pasted text) for peer review. The skill performs four passes: (1) extracts all author-year citations and verifies claims against original PDFs in the user's Zotero library to ensure faithfulness, consulting the manuscript's reference list to disambiguate author-year citations; (2) identifies unsupported or weakly supported claims and suggests additional citations via semantic search of the library; (3) checks for logical inconsistencies, contradictions, and reasoning gaps within the manuscript; (4) copyedits for grammar, style, punctuation, flow, and journal conventions. Trigger on phrases like 'audit my manuscript,' 'review my citations,' 'fact-check my claims,' 'check my paper,' or 'prepare for submission.' (Claude Code only — uses litmap for semantic citation-gap detection.)" compatibility: "Claude Code only. Requires zotero skill, pdf-reading skill, file-reading skill (if manuscript uploaded as file), and a working `uv run litmap` install at ~/src/Cowork/litmap."

Manuscript Audit for Scientific Journal Submission

⚠️ Runtime: Claude Code only. This skill calls uv run litmap … against ~/LitLake/embeddings.db on the local machine. It will not work in the Cowork web sandbox. If you reached this skill from the Cowork web frontend, stop and switch to Claude Code.

A four-stage workflow to verify citations against sources, identify evidence gaps, detect logical flaws, and polish a manuscript draft before submission.

Stage 1: Citation Extraction & Faithfulness Audit

Input

Manuscript (docx, pdf, or pasted text)
User's Zotero library (via zotero skill)

Process

4b. Locate PDFs in Zotero: Query the Zotero SQLite database directly rather than grepping filenames. Filename-based search is unreliable for two reasons: (a) auto-generated filenames may not include the author name at all (e.g., an organisation such as "Nature Positive Initiative" may be saved under the document title), and (b) overly broad filename patterns return hundreds of false positives, making truncation errors likely. Instead, run a Python query against zotero.sqlite:

Substitute the author's last name and year from the reference list. For multi-word organisation names (e.g. "Nature Positive Initiative"), search by the first distinctive word as lastName LIKE '%Nature Positive%' or search by title keywords instead. The path field returned is of the form storage:filename.pdf; prepend /mnt/Zotero/storage/<key>/ using the item key to get the full path. If multiple results are returned for the same author+year, use the title from the reference list entry to select the correct one.

Important: Zotero PDF filenames are generated automatically and may contain errors — misspelled author names, wrong years, truncated titles. Never use a filename mismatch as evidence of a citation error in the manuscript. Always resolve ambiguity by matching against the full reference list entry (title, journal, DOI), not the filename alone.

4c. Check for reading notes: Once a Zotero item is matched, query any attached reading notes as a secondary source before opening the PDF:

Use PDF summary sections (content before <hr/>) to quickly locate passages relevant to the claim — they save time when skimming a long PDF. Treat as helpful but non-authoritative; always confirm any finding against the PDF itself.

Use DY sections (content after <hr/>) as context only for understanding how the paper was intended to be used. Never cite DY content in a faithfulness verdict.

In the Stage 1 output, add a Notes subsection where reading notes exist:

Extract and verify: For each PDF found:
- Extract full text using pdf-reading skill
- Search for keywords from the claim (nouns, key concepts)
- Extract relevant passages (context: 2–3 sentences before/after the keyword)
- Compare the claim in the manuscript to the passage in the PDF
- Record verdict: ✓ Faithful (claim matches source), ⚠ (claim goes beyond source), ✗ (no matching passage found), or (only part of the claim is supported)

5b. Check for unquoted verbatim phrases: While the PDF text is in hand, also scan the manuscript sentence(s) surrounding this citation for word-for-word borrowing from the source that is not enclosed in quotation marks. A run of 5 or more consecutive words appearing identically in both texts is a strong signal; 4 words is worth flagging if the phrasing is distinctive (e.g. a notable characterisation like "notoriously difficult").

Flag any match with the verdict ⚠ Unquoted verbatim phrase and report:

The matched phrase
The original sentence in the manuscript
The source sentence from the PDF
A suggested fix: either wrap in quotation marks and add a page reference, or paraphrase

Report format:

Missing PDFs: Before flagging a paper as absent, always: (a) search Zotero using variant spellings, partial author names, and keywords from the title; (b) check the manuscript's full reference list (which may be in a separate document if the PDF says "reference list located elsewhere") for the complete citation details, then re-search. Only flag as "PDF not found in library" after both steps have been attempted. When flagging, include the full reference entry from the reference list so the user can verify.

Output Format

For each citation, report:

Alternatively for overstatement:

Stage 2: Citation Gap Detection (Semantic)

Calls litmap search against the user's local embeddings database. Requires Claude Code runtime — see banner above.

Input

Manuscript (docx, pdf, or pasted text)
Reference list built in Stage 1
User's Zotero library (read via the zotero skill)
(Optional) --collection scope if the user named one

Procedure

Failure modes

Cluster step fails: skip silently, proceed to per-claim search.
LanceDB unavailable: Make sure you are passing the correct database path ~/.omnimind/lancedb.

Stage 3: Logical Consistency Check

Input

The manuscript (full text)
Citation audit and gap analysis from Stages 1–2

Process

Output Format

Stage 4: Copyediting Pass

Input

The full manuscript

Process

Output Format

Integration & Output

Run all four stages in sequence. Deliver:

Dependencies

zotero skill — to query the Zotero library and retrieve PDFs
pdf-reading skill — to extract text from PDFs and verify claims
file-reading skill (if docx/pdf uploaded) — to read the manuscript
Standard regex and text processing (built-in)

Workflow Notes

Timing: Expect 10–20 minutes for an 8,000-word manuscript, depending on citation count and PDF availability.
Zotero integration: Assumes Doug's Zotero library is accessible at /mnt/Zotero/ (Cowork environment) or via the zotero skill.
Citation format: Works with author-year citations (Smith 2020, Jones et al. 2019). Numbered citations [1], [2] require different parsing; confirm format before beginning.

omnimind

Manuscript Audit for Scientific Journal Submission

Stage 1: Citation Extraction & Faithfulness Audit

Input

Process

Output Format

Stage 2: Citation Gap Detection (Semantic)

Input

Procedure

Failure modes

Stage 3: Logical Consistency Check

Input

Process

Output Format

Stage 4: Copyediting Pass

Input

Process

Output Format

Integration & Output

Dependencies

Workflow Notes