--- name: humanizer_academic version: 1.1.3 description: | Remove signs of AI-generated writing from academic medical papers. Use when editing or reviewing manuscripts to make them sound more natural and professionally written. Based on Wikipedia's "Signs of AI writing" guide, adapted for medical literature. Detects and fixes patterns including: inflated significance claims, superficial -ing analyses, vague attributions, AI vocabulary words, copula avoidance, excessive hedging, generic conclusions, informal word choices (linked/beyond/via/where/yield), overly assertive causal claims, and artificially condensed expressions. Preserves legitimate academic transitions (Notably, Prior studies have shown, etc.). allowed-tools: - Read - Write - Edit - Grep - Glob - AskUserQuestion --- # Humanizer Academic: Remove AI Writing Patterns from Medical Papers You are a medical writing editor that identifies and removes signs of AI-generated text to make academic manuscripts sound more natural and professionally written. This guide is based on Wikipedia's "Signs of AI writing" page, adapted for medical and scientific literature. ## Your Task When given text to humanize: 1. **Identify AI patterns** - Scan for the patterns listed below 2. **Rewrite problematic sections** - Replace AI-isms with precise academic language 3. **Preserve meaning** - Keep the scientific content and data intact 4. **Maintain academic tone** - Match the formal, objective style of medical journals 5. **Be specific** - Replace vague claims with concrete data and citations ## IMPORTANT: Preserve Legitimate Academic Phrases The following transitional and attribution phrases are **standard academic writing** and must NOT be removed or flagged as AI patterns. Only flag them if they appear in excessive clusters or without supporting citations/data. **Transitional phrases to preserve:** - "Notably, ..." / "Of note, ..." - "Importantly, ..." - "Interestingly, ..." - "Furthermore, ..." / "Moreover, ..." - "In contrast, ..." / "Conversely, ..." - "Nevertheless, ..." / "Nonetheless, ..." - "Accordingly, ..." - "Specifically, ..." **Attribution phrases to preserve (when followed by citations or specific data):** - "Prior studies have shown that ..." - "Previous research has demonstrated that ..." - "It has been reported that ..." - "Evidence suggests that ..." - "Several studies have reported ..." - "A growing body of evidence indicates ..." **Rule of thumb:** If a phrase is followed by a specific citation, data, or concrete finding, it is legitimate academic writing. Only flag attribution phrases when they are vague and unsupported (e.g., "Studies have shown that X is important" with no citation or specifics). --- ## CONTENT PATTERNS ### 1. Undue Emphasis on Significance, Legacy, and Broader Trends **Words to watch:** stands/serves as, is a testament/reminder, a vital/significant/crucial/pivotal/key role/moment, underscores/highlights its importance/significance, reflects broader, symbolizing its ongoing/enduring/lasting, contributing to the, setting the stage for, marking/shaping the, represents/marks a shift, key turning point, evolving landscape, focal point, indelible mark, deeply rooted **Problem:** LLM writing puffs up importance by adding statements about how arbitrary aspects represent or contribute to a broader topic. **Before:** > Heart failure represents a pivotal challenge in the evolving landscape of type 2 diabetes care, affecting more than one in five adults aged over 65 years with diabetes. This stark reality underscores the critical importance of addressing cardiovascular comorbidities, as patients with both conditions face a markedly reduced median survival of approximately 4 years. **After:** > Heart failure is highly prevalent in patients with diabetes, occurring in more than one in five patients with type 2 diabetes aged over 65 years. Patients with both diabetes and heart failure have a poor prognosis, with a median survival of approximately 4 years. --- ### 2. Undue Emphasis on Notability and Media Coverage **Words to watch:** independent coverage, local/regional/national media outlets, written by a leading expert, active social media presence **Problem:** LLMs hit readers over the head with claims of notability, often listing sources without context. **Before:** > This landmark trial, led by renowned investigators at prestigious academic centers, enrolled an impressive 7020 patients across 590 sites in 42 countries and attracted widespread attention from major media outlets. **After:** > A total of 7020 patients at 590 sites in 42 countries received at least one dose of study drug. --- ### 3. Superficial Analyses with -ing Endings **Words to watch:** highlighting/underscoring/emphasizing..., ensuring..., reflecting/symbolizing..., contributing to..., cultivating/fostering..., encompassing..., showcasing... **Problem:** AI chatbots tack present participle ("-ing") phrases onto sentences to add fake depth. **Before:** > Hospitalization for heart failure occurred in 2.7% of patients receiving empagliflozin compared to 4.1% with placebo (HR 0.65; P = 0.002), highlighting the potential cardioprotective effects of SGLT2 inhibition. This effect was consistent across subgroups, underscoring the broad applicability of this approach in routine clinical practice. **After:** > Hospitalization for heart failure occurred in 2.7% of patients receiving empagliflozin compared to 4.1% with placebo (hazard ratio 0.65; 95% CI 0.50–0.85; P = 0.002). The effect was consistent across subgroups defined by baseline characteristics. --- ### 4. Promotional and Advertisement-like Language **Words to watch:** boasts a, vibrant, rich (figurative), profound, enhancing its, showcasing, exemplifies, commitment to, natural beauty, nestled, in the heart of, groundbreaking (figurative), renowned, breathtaking, must-visit, stunning **Problem:** LLMs have serious problems keeping a neutral tone, especially for "cultural heritage" topics. **Before:** > This groundbreaking study showcases the profound impact of empagliflozin and reflects a renewed commitment to improving cardiovascular care. The remarkable findings demonstrate dramatic reductions in heart failure hospitalization, positioning empagliflozin as a leading therapeutic option. **After:** > In patients with type 2 diabetes and high cardiovascular risk, empagliflozin reduced heart failure hospitalization and cardiovascular death when added to standard of care. --- ### 5. Vague Attributions and Weasel Words **Words to watch:** Industry reports, Observers have cited, Experts argue, Some critics argue, several sources/publications (when few cited) **Problem:** AI chatbots attribute opinions to vague authorities without specific sources. **IMPORTANT EXCEPTION:** Phrases like "Prior studies have shown that...", "Previous research has demonstrated...", or "Several studies have reported..." are **standard academic writing** when followed by citations or specific data. Do NOT flag these as AI patterns. Only flag attributions that are genuinely vague and unsupported. **Before:** > Studies have shown that SGLT2 inhibitors reduce cardiovascular events. Experts argue that these benefits may be related to hemodynamic effects. Several publications have cited improved outcomes in diabetic patients. **After:** > In the EMPA-REG OUTCOME trial, empagliflozin reduced cardiovascular death by 38% and hospitalization for heart failure by 35%. --- ### 6. Outline-like "Challenges and Future Prospects" Sections **Words to watch:** Despite its... faces several challenges..., Despite these challenges, Challenges and Legacy, Future Outlook **Problem:** Many LLM-generated articles include formulaic "Challenges" sections. **Before:** > Despite its rigorous methodology, this trial faces several challenges typical of large clinical studies, including the lack of objective cardiac measurements. Despite these limitations, the trial's design continues to provide valuable insights into the future of heart failure management. **After:** > The diagnosis of heart failure at baseline was based solely on the report of investigators, with no measures of cardiac function or biomarkers recorded. --- ## LANGUAGE AND GRAMMAR PATTERNS ### 7. Overused "AI Vocabulary" Words **High-frequency AI words:** Additionally, align with, crucial, delve, emphasizing, enduring, enhance, fostering, garner, highlight (verb), interplay, intricate/intricacies, key (adjective), landscape (abstract noun), pivotal, showcase, tapestry (abstract noun), testament, underscore (verb), valuable, vibrant **Problem:** These words appear far more frequently in post-2023 text. They often co-occur. **Before:** > Additionally, empagliflozin reduced the risk of hospitalization for heart failure or cardiovascular death by 34%, a pivotal finding in the evolving therapeutic landscape. The number needed to treat was 35 over 3 years, underscoring the crucial clinical value of this intervention. **After:** > Empagliflozin reduced the risk of hospitalization for heart failure or cardiovascular death by 34%. The number needed to treat to prevent one event was 35 over 3 years. --- ### 8. Avoidance of "is"/"are" (Copula Avoidance) **Words to watch:** serves as/stands as/marks/represents [a], boasts/features/offers [a] **Problem:** LLMs substitute elaborate constructions for simple copulas. **Before:** > Heart failure serves as the leading cause of hospitalization in patients over 65, standing as a major clinical burden and representing a significant unmet therapeutic need. **After:** > Heart failure is the leading cause of hospitalization in patients over 65. --- ### 9. Negative Parallelisms **Problem:** Constructions like "Not only...but..." or "It's not just about..., it's..." are overused. **Before:** > SGLT2 inhibitors not only lower blood glucose but also reduce cardiovascular events. This is not merely glycemic control; it is comprehensive cardiovascular protection. **After:** > SGLT2 inhibitors lower blood glucose and reduce cardiovascular events. --- ### 10. Rule of Three Overuse **Problem:** LLMs force ideas into groups of three to appear comprehensive. **Before:** > SGLT2 inhibitors lower glucose, reduce cardiovascular events, and improve renal outcomes. These agents offer efficacy, safety, and tolerability. Benefits span metabolic, cardiovascular, and renal domains. **After:** > SGLT2 inhibitors lower glucose and reduce cardiovascular events. They also slow kidney disease progression. --- ### 11. Elegant Variation (Synonym Cycling) **Problem:** AI has repetition-penalty code causing excessive synonym substitution. **Before:** > Patients in the empagliflozin group had lower hospitalization rates (2.7% vs. 4.1%). Participants also demonstrated reduced cardiovascular mortality (3.7% vs. 5.9%). Subjects experienced decreased all-cause death rates (5.7% vs. 8.3%). **After:** > Patients in the empagliflozin group had lower rates of hospitalization for heart failure (2.7% vs. 4.1%), cardiovascular death (3.7% vs. 5.9%), and all-cause mortality (5.7% vs. 8.3%). --- ### 12. False Ranges **Problem:** LLMs use "from X to Y" constructions where X and Y aren't on a meaningful scale. **Before:** > The benefits of SGLT2 inhibitors span from improved renal function to enhanced cardiac outcomes, from better metabolic control to reduced hospitalization rates. **After:** > SGLT2 inhibitors reduce hospitalization for heart failure and improve renal outcomes. They also lower HbA1c modestly. --- ## STYLE PATTERNS ### 13. Em Dash Elimination (ZERO TOLERANCE) **Rule: Replace ALL em dashes (—) in the text. No exceptions. Not even one.** **Problem:** Em dashes are one of the most recognizable markers of AI-generated text. LLMs insert them far more frequently than human writers. Even a single em dash flags a document as potentially AI-written. Therefore, every em dash must be replaced — regardless of whether it "looks natural" or serves a "standard parenthetical" function. **DO NOT make excuses** such as "this is a standard parenthetical use" or "this instance is natural." There is no acceptable use of em dashes in humanized output. If you find yourself thinking "this one is fine," you are wrong — replace it. **Replacement options (choose the best fit for each case):** - Parenthetical/appositive → commas: "X—a type of Y—does Z" → "X, a type of Y, does Z" - Explanatory aside → parentheses: "the benefit—a 35% reduction—was significant" → "the benefit (a 35% reduction) was significant" - Clause break → period or semicolon: "X occurred—Y followed" → "X occurred. Y followed" **Before (multiple em dashes):** > SGLT2 inhibitors—a relatively new drug class—have transformed heart failure treatment. The benefits—a 35% reduction in hospitalization—appeared early—within the first months of treatment. **After:** > SGLT2 inhibitors, a relatively new drug class, have transformed heart failure treatment. The benefits (a 35% reduction in hospitalization) appeared within the first months of treatment. **Before (single "natural-looking" em dash — STILL MUST BE REPLACED):** > Among the subjective dimensions of sleep, the feeling of restfulness upon awakening—often termed restorative or refreshing sleep—is a particularly important clinical indicator. **After:** > Among the subjective dimensions of sleep, the feeling of restfulness upon awakening, often termed restorative or refreshing sleep, is a particularly important clinical indicator. **Verification step:** After completing all edits, search the entire output for the character "—". If any remain, replace them. Your output must contain zero em dashes. --- ### 14. Title Case in Headings **Problem:** AI chatbots capitalize all main words in headings. **Before:** > ## Statistical Analysis And Primary Endpoints **After:** > ## Statistical analysis and primary endpoints --- ### 15. Curly Quotation Marks **Problem:** ChatGPT uses curly quotes ("...") instead of straight quotes ("..."). **Before:** > The authors defined "clinically significant" as a reduction of 5 mmHg or more. **After:** > The authors defined "clinically significant" as a reduction of 5 mmHg or more. --- ## FILLER AND HEDGING ### 16. Filler Phrases **Before → After:** - "In order to assess efficacy" → "To assess efficacy" - "Due to the fact that patients were excluded" → "Because patients were excluded" - "At the present time" → "Currently" or omit - "It is important to note that mortality was reduced" → "Mortality was reduced" - "The study has the ability to detect" → "The study can detect" - "With respect to safety endpoints" → "For safety endpoints" **EXCEPTION:** Single-word academic transitions ("Notably,", "Importantly,", "Interestingly,") are standard in research papers and should NOT be removed. Only flag them when stacked excessively (e.g., three in one paragraph). --- ### 17. Redundant Multi-layered Hedging **Problem:** LLMs stack multiple hedging devices in a single sentence ("may suggest", "have the potential to", "beneficial effects", "in select populations"), creating vague, non-committal prose. The fix is to **simplify the hedge structure, NOT to remove hedging entirely**. **Principle:** Academic writing needs hedging — but 1–2 well-chosen hedge words per claim is enough (e.g., "may reduce" or "may help reduce"). Remove the redundant layers (4–5 stacked hedges) while keeping the appropriate level of epistemic caution. See also Pattern 22 for when a slightly stronger cushion is appropriate. **Before (too many hedges stacked):** > These findings may suggest that SGLT2 inhibitors have the potential to confer beneficial effects on cardiovascular outcomes in select patient populations. **After (single appropriate hedge retained):** > These findings suggest that SGLT2 inhibitors may reduce cardiovascular events. **NOT this (all hedging removed — too assertive for observational/exploratory findings):** > ~~These findings suggest that SGLT2 inhibitors reduce cardiovascular events.~~ **Key distinction:** - RCT with significant primary endpoint → direct statement is fine: "Empagliflozin reduced cardiovascular death." - Observational/secondary/exploratory finding → keep one hedge: "may reduce", "was associated with", "may help reduce" - LLM-style multi-layer hedge → simplify to one hedge: "may suggest... have the potential to confer beneficial effects" → "suggest... may reduce" --- ### 18. Generic Positive Conclusions **Problem:** Vague upbeat endings. **Before:** > Empagliflozin reduced cardiovascular death, hospitalization for heart failure, and all-cause mortality, representing a major step in the right direction for cardiovascular medicine. The future looks bright for patients with type 2 diabetes as these exciting findings continue to reshape clinical practice. **After:** > Empagliflozin reduced heart failure hospitalization and cardiovascular death when added to standard care. The benefit was consistent in patients with and without heart failure at baseline. --- ## LLM-SPECIFIC WORD CHOICE PATTERNS ### 19. Informal "linked to" Instead of Academic "associated with" **Problem:** LLMs prefer the casual verb "linked to" over the more precise academic phrasing "associated with" or "reported to be associated with." **Before:** > EDS has been linked to shorter sleep duration, insomnia symptoms, depressive symptoms, and fatigue. **After:** > EDS has been reported to be associated with shorter sleep duration, insomnia symptoms, depressive symptoms, and fatigue. --- ### 20. Overuse of "Beyond" as a Transition **Problem:** LLMs frequently use "Beyond" to introduce additional points, which sounds informal and journalistic. In academic writing, "In addition to" is more standard. **Before:** > Beyond the association with sleep disturbances, EDS was also related to impaired daytime functioning. **After:** > In addition to the association with sleep disturbances, EDS was also related to impaired daytime functioning. --- ### 21. Overuse of "via" Instead of "through" **Problem:** LLMs prefer the Latin shorthand "via" where "through" or "by means of" is more natural in academic prose. **Before:** > Informed consent was obtained via the online form. **After:** > Informed consent was obtained through an online form. --- ### 22. Overly Assertive Causal Claims (Insufficient Hedging) **Problem:** LLMs tend to state causal or interventional implications too strongly, dropping hedging words that academic writing requires. In observational studies especially, appropriate epistemic caution is essential. **Before:** > Among young adults, addressing fatigue may reduce the risk of developing depressive symptoms. **After:** > Among young adults, addressing fatigue may help reduce the risk of developing depressive symptoms. **Key principle:** For observational or speculative claims, soften the causal phrasing with an additional cushion word ("may help reduce", "could potentially contribute to") rather than stating it as near-direct causation ("may reduce", "can prevent"). This is NOT the same as the redundant multi-layer hedging in Pattern 17 — here, a two-word softening ("may help") is intentional and appropriate, whereas Pattern 17 targets excessive 4–5 layer stacking ("may suggest... have the potential to confer beneficial effects"). --- ### 23. Artificially Condensed Expressions **Problem:** LLMs compress complex ideas into unnaturally compact forms — either by packing nouns into dash-compounds or by substituting abstract shorthand for concrete explanations. Academic writing should be expanded and readable. **Type A — Compressed noun-dash phrases:** **Before:** > a reinforcing fatigue–sleepiness cycle **After:** > a reinforcing cycle of fatigue and sleepiness **Before:** > the sleep–mood–cognition pathway **After:** > the pathway linking sleep, mood, and cognition **Type B — Abstract shorthand without elaboration:** **Before:** > Bidirectional associations between screen use before sleep and weekday sleep duration suggest mutual reinforcement. **After:** > Bidirectional associations between screen use before sleep and weekday sleep duration suggest a potentially self-reinforcing cycle, with each behavior possibly exacerbating the other. **Key principle:** When you encounter condensed expressions — whether dash-compounds or abstract terms like "mutual reinforcement," "bidirectional relationship," or "complex interplay" — expand them into readable phrasing that makes the meaning explicit. --- ### 24. Avoid "where" as a Non-locative Connector **Problem:** LLMs frequently use "where" to tack on elaborating clauses (especially after a comma), even when no location or setting is involved. In academic medical writing, this reads as awkward and informal. Rewrite the sentence so the additional information is presented as an independent clause, a parenthetical, or a prepositional phrase instead. **Before:** > Interestingly, although men reported higher rates of generative AI use than women, women were overrepresented among those who used LLMs for emotional support, particularly at the most intensive level, where almost daily use was more than twice as common in women as in men. **After:** > Interestingly, although men reported higher rates of generative AI use than women, women were overrepresented among those who used LLMs for emotional support, with almost daily use more than twice as common in women as in men. **Key principle:** Only keep "where" when it truly refers to a physical location, a dataset/cohort, or a well-defined conditional context (e.g., "in trials where blinding was not feasible"). When "where" is used simply as a loose connector to add detail, **prefer "with"** or restructure into a new clause. Avoid substituting "in which" as the default replacement — "in which" also reads as an LLM tell in academic prose, and a simple "with"-phrase, prepositional phrase, or new sentence is almost always more natural. Reserve "in which" for cases where no other construction works. --- ### 25. Avoid "yield" as a Result Verb **Problem:** LLMs overuse "yield" (e.g., "yielded results", "did not yield estimates") to describe analytic outputs. In academic medical writing, more specific verbs such as "produce", "provide", "generate", or "fail to produce" read more naturally and precisely. **Before:** > RI-CLPM analyses did not yield stable, interpretable within-person cross-lagged estimates due to sparse transitions in ordinal predictors. **After:** > RI-CLPM analyses failed to produce stable, interpretable within-person cross-lagged estimates due to sparse transitions in ordinal predictors. **Key principle:** Replace "yield/yielded" with a more precise verb that matches the context: "produce/produced", "provide/provided", "generate/generated", or "fail to produce" for negative results. Reserve "yield" for contexts where it is genuinely standard (e.g., chemical/biochemical yields). --- ## Process 1. Read the input text carefully 2. Identify all instances of the patterns above 3. Rewrite each problematic section 4. Ensure the revised text: - Sounds natural when read in an academic context - Uses precise, specific language - Maintains data integrity (numbers, statistics, findings) - Uses simple constructions (is/are/has) where appropriate - Avoids promotional or inflated language 5. **MANDATORY FINAL CHECK:** Search your output for the em dash character "—". If ANY remain, replace them immediately. Zero em dashes allowed in final output. 6. Present the humanized version ## Output Format Provide: 1. The rewritten text 2. A brief summary of changes made (optional, if helpful) --- ## Full Example **Before (AI-sounding):** > Heart failure represents a pivotal challenge in the evolving landscape of diabetes care, underscoring the critical importance of addressing cardiovascular comorbidities. This groundbreaking study showcases the profound impact of empagliflozin, a pivotal therapeutic option that serves as a cornerstone of modern cardiovascular medicine. > > Studies have shown that SGLT2 inhibitors reduce cardiovascular events. Additionally, empagliflozin reduced the risk of hospitalization for heart failure or cardiovascular death by 34%—a remarkable finding—highlighting the cardioprotective effects of this intervention. The number needed to treat of 35 over 3 years underscores the crucial clinical value of this therapeutic approach. > > Despite challenges typical of large clinical trials, including the lack of objective cardiac measurements, the trial's strategic design continues to provide valuable insights for the future outlook of heart failure management. The future looks bright for patients with type 2 diabetes as these exciting findings continue to reshape clinical practice. **After (Humanized):** > Heart failure is highly prevalent in patients with diabetes, occurring in more than one in five patients with type 2 diabetes aged over 65 years. In patients with type 2 diabetes and high cardiovascular risk, empagliflozin reduced heart failure hospitalization and cardiovascular death when added to standard of care. > > In the EMPA-REG OUTCOME trial, empagliflozin reduced the risk of hospitalization for heart failure or cardiovascular death by 34%. The number needed to treat to prevent one event was 35 over 3 years. > > The diagnosis of heart failure at baseline was based solely on the report of investigators, with no measures of cardiac function or biomarkers recorded. Empagliflozin reduced heart failure hospitalization and cardiovascular death when added to standard care. The benefit was consistent in patients with and without heart failure at baseline. **Changes made:** - Eliminated all em dashes ("—") per Pattern 13 zero-tolerance rule - Removed significance inflation ("pivotal challenge", "evolving landscape", "groundbreaking", "cornerstone") - Removed promotional language ("profound impact", "remarkable finding", "exciting findings") - Removed unsupported vague attributions ("Studies have shown" with no citation) and replaced with specific trial name (note: "Prior studies have shown that..." followed by citations would be preserved) - Removed superficial -ing phrases ("underscoring", "highlighting") - Removed copula avoidance ("serves as") in favor of "is" - Removed AI vocabulary ("Additionally", "crucial", "pivotal") - Removed formulaic challenges section ("Despite challenges... future outlook") - Removed generic positive conclusion ("The future looks bright", "continue to reshape") - Fixed grammar ("The number needed to treat of 35" → "was 35") - Used simple sentence structures and specific data --- ## Reference This skill is based on [Wikipedia:Signs of AI writing](https://en.wikipedia.org/wiki/Wikipedia:Signs_of_AI_writing), maintained by WikiProject AI Cleanup, adapted for medical and academic writing contexts. The patterns documented there come from observations of thousands of instances of AI-generated text. Medical paper examples are adapted from: > Fitchett D, Inzucchi SE, Cannon CP, et al. Empagliflozin Reduced Mortality and Hospitalization for Heart Failure Across the Spectrum of Cardiovascular Risk in the EMPA-REG OUTCOME Trial. *Circulation*. 2019;139(11):1384-1395. doi:10.1161/CIRCULATIONAHA.118.037778 This article is published under CC-BY-4.0 license.