- ICH GCP
- US Clinical Trials Registry
- Clinical Trial NCT07427927
Data-driven Clustering in Hemorrhoid Surgery: Retrospective Monocentric Study for the Identification of Clinical Phenotypes (PROCTO-CLUSTER)
Study Overview
Status
Conditions
Intervention / Treatment
Detailed Description
Hemorrhoidal disease presents with heterogeneous symptom patterns, anatomical findings, and operative strategies that are not fully captured by traditional degree-based classifications. This study aims to identify latent, clinically interpretable phenotypes among surgical patients using a fully unsupervised machine learning pipeline applied to routinely collected perioperative data from a high-volume tertiary referral center.
This is a retrospective, observational analysis of de-identified institutional records. The analytic dataset will include routinely documented variables spanning baseline demographics/anthropometrics, symptom profile and relevant clinical history, operative technique and intraoperative descriptors, and routinely captured postoperative follow-up information. Data will be extracted using a predefined data dictionary and standardized preprocessing rules to support reproducibility and reduce variability in variable definitions.
The primary analytic approach will be unsupervised clustering. Variables will be cleaned and standardized prior to modeling. Dimensionality reduction will be performed using t-distributed stochastic neighbor embedding (t-SNE), initialized with principal component analysis to improve stability. Cluster discovery will then be conducted using k-means clustering on the reduced feature space. A range of cluster solutions will be explored, and the final solution will be selected using internal validity metrics (e.g., silhouette-based measures) together with assessment of clinical interpretability. Model robustness will be evaluated through repeated runs across multiple random seeds and key parameter settings to assess stability of cluster assignments.
After cluster assignment, clusters will be characterized using descriptive and comparative statistics to identify variables that most differentiate phenotypes. Post-hoc feature relevance/importance approaches will be used to explore which demographic, clinical, and surgical factors most strongly contribute to cluster formation, with emphasis on effect sizes and clinically meaningful patterns rather than hypothesis-testing alone. Findings will be used to generate hypotheses regarding phenotypes that may be associated with greater operative complexity and different postoperative trajectories, supporting future work on predictive modeling and individualized surgical decision support.
All analyses will be conducted within a controlled institutional environment using validated statistical and data-mining software, with documented parameter settings and version tracking to enable reproducibility. Only de-identified data will be used for analysis, and results will be reported in aggregate to protect patient privacy.
Study Type
Enrollment (Estimated)
Contacts and Locations
Study Locations
-
-
Milan
-
San Donato Milanese, Milan, Italy, 20097
- IRCCS Policlinico San Donato
-
-
Participation Criteria
Eligibility Criteria
Ages Eligible for Study
- Adult
- Older Adult
Accepts Healthy Volunteers
Sampling Method
Study Population
Description
Inclusion criteria
- Age ≥ 18 years
- Clinical and/or intraoperative diagnosis of symptomatic hemorrhoidal disease
- Availability of complete perioperative data: demographic, clinical, surgical, and postoperative variables Exclusion criteria
- Incomplete or missing clinical data
- Presence of anorectal neoplastic conditions (e.g., anal or rectal carcinoma)
- Anorectal surgery within the previous 6 months (to avoid confounding effects on symptoms and anatomy)
Study Plan
How is the study designed?
Design Details
Cohorts and Interventions
Group / Cohort |
Intervention / Treatment |
|---|---|
|
patients who underwent surgery for symptomatic hemorrhoidal disease
Consecutive patients who underwent surgery for symptomatic hemorrhoidal disease at IRCCS Policlinico San Donato between December 2024 and June 2025. Consecutive enrollment was chosen to minimize selection bias and to represent the full spectrum of disease severity in the surgical setting. Inclusion criteria Age ≥ 18 years Clinical and/or intraoperative diagnosis of symptomatic hemorrhoidal disease Availability of complete perioperative data: demographic, clinical, surgical, and postoperative variables Exclusion criteria Incomplete or missing clinical data Presence of anorectal neoplastic conditions (e.g., anal or rectal carcinoma) Anorectal surgery within the previous 6 months (to avoid confounding effects on symptoms and anatomy) |
standard hemorrhoidectomy, advanced hemorrhoidectomy, prolapsectomy, Doppler-guided procedures, or combined techniques
|
What is the study measuring?
Primary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Internal validity of the unsupervised clustering solution (silhouette coefficient)
Time Frame: From completion of dataset extraction/cleaning through completion of clustering analysis (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Silhouette coefficient of the final k-means clustering solution derived from t-SNE-reduced perioperative data.
The silhouette coefficient will be used as the primary internal validity metric to quantify cluster cohesion and separation for the selected number of clusters.
|
From completion of dataset extraction/cleaning through completion of clustering analysis (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Secondary Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Cluster stability and reproducibility across model runs
Time Frame: From completion of dataset extraction/cleaning through completion of clustering robustness analyses (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Stability of cluster assignments across multiple random seeds and t-SNE parameter settings (including perplexity), summarized by reproducibility/consistency of membership and stability of internal validity metrics across runs.
|
From completion of dataset extraction/cleaning through completion of clustering robustness analyses (retrospective analysis of surgeries performed December 2024 to June 2025)
|
|
Operative duration (proxy of operative complexity)
Time Frame: Intraoperative (day of surgery)
|
Operative duration (minutes) recorded in the operative report/perioperative database; compared across identified phenotypes.
|
Intraoperative (day of surgery)
|
|
Postoperative pain intensity
Time Frame: From surgery to 6 month postoperatively
|
Pain intensity as documented in routine postoperative records/follow-up notes (e.g., numeric rating scale when available or clinician-documented pain status), analyzed as pain trajectory/pattern across early and intermediate follow-up and compared across clusters.
|
From surgery to 6 month postoperatively
|
|
Postoperative complications (Clavien-Dindo classification)
Time Frame: From surgery to 1 month postoperatively (early complications) and up to 6 months postoperatively (late complications)
|
Any postoperative complication recorded in routine follow-up, graded according to the Clavien-Dindo system; complication rates and severity compared across clusters.
|
From surgery to 1 month postoperatively (early complications) and up to 6 months postoperatively (late complications)
|
|
Time to return to routine activities
Time Frame: From surgery to 1 month postoperatively
|
Time to return to routine activities/work when documented in follow-up notes; compared across phenotypes.
|
From surgery to 1 month postoperatively
|
|
Recurrence
Time Frame: 1 month and 6 months postoperatively
|
Recurrence patterns or persistence/return of hemorrhoid-related symptoms (e.g., bleeding/prolapse/other symptoms) as documented in routine follow-up and need for re-evaluation or additional intervention; compared across clusters.
|
1 month and 6 months postoperatively
|
Other Outcome Measures
Outcome Measure |
Measure Description |
Time Frame |
|---|---|---|
|
Cluster separation metrics (beyond silhouette)
Time Frame: From completion of dataset extraction/cleaning through completion of clustering analysis (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Additional internal cluster separation metrics (e.g., measures of between-cluster separation/within-cluster dispersion as implemented in the analytic workflow) reported to support interpretability of the phenotype solution.
|
From completion of dataset extraction/cleaning through completion of clustering analysis (retrospective analysis of surgeries performed December 2024 to June 2025)
|
|
Between-cluster differences in clinical/anatomical/surgical characteristics
Time Frame: Baseline (preoperative assessment) and intraoperative (day of surgery)
|
Differences across clusters in routinely collected demographic and clinical history variables (e.g., age, sex, BMI, comorbidity burden, medications, symptom profile, bowel habit characteristics), anatomical descriptors (when documented), and procedure type/technique selection.
|
Baseline (preoperative assessment) and intraoperative (day of surgery)
|
|
Post-hoc feature relevance for cluster formation
Time Frame: From completion of dataset extraction/cleaning through completion of post-hoc feature relevance analyses (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Relative contribution/importance of demographic, clinical, and surgical variables to cluster formation assessed using post-hoc feature relevance approaches; used to interpret drivers of phenotype structure.
|
From completion of dataset extraction/cleaning through completion of post-hoc feature relevance analyses (retrospective analysis of surgeries performed December 2024 to June 2025)
|
Collaborators and Investigators
Sponsor
Publications and helpful links
General Publications
- Bernabé-Díaz JA, Franco M, Vivo JM, Fernández-Breis JT. Optimizing clustering-based analytical methods with trimmed and sparse clustering. Computers in Biology and Medicine. 2025;194:110436. doi:10.1016/j.compbiomed.2025.110436
- Xiao C, Hong S, Huang W. Optimizing graph layout by t-SNE perplexity estimation. Int J Data Sci Anal. 2023;15(2):159-171. doi:10.1007/s41060-022-00348-7
- Goldenholz, D.M.; Sun, H.; Ganglberger, W.; Westover, M.B. Sample Size Analysis for Machine Learning Clinical Validation Studies. Biomedicines 2023, 11, 685. https://doi.org/10.3390/ biomedicines11030685
- De Marco S, Tiso D. Lifestyle and Risk Factors in Hemorrhoidal Disease. Front Surg. 2021 Aug 18;8:729166. doi: 10.3389/fsurg.2021.729166. eCollection 2021.
- Robinson PN, Mungall CJ, Haendel M. Capturing phenotypes for precision medicine. Cold Spring Harb Mol Case Stud. 2015 Oct;1(1):a000372. doi: 10.1101/mcs.a000372.
- Brillantino A, Renzi A, Talento P, Brusciano L, Marano L, Grillo M, Maglio MN, Foroni F, Palumbo A, Sotelo MLS, Vicenzo L, Lanza M, Frezza G, Antropoli M, Gambardella C, Monaco L, Ferrante I, Izzo D, Giordano A, Pinto M, Fantini C, Gasparrini M, Schiano Di Visconte M, Milazzo F, Ferreri G, Braini A, Cocozza U, Pezzatini M, Gianfreda V, Di Leo A, Landolfi V, Favetta U, Agradi S, Marino G, Varriale M, Mongardini M, Pagano CEFA, Contul RB, Gallese N, Ucchino G, D'Ambra M, Rizzato R, Sarzo G, Masci B, Da Pozzo F, Ascanelli S, Liguori P, Pezzolla A, Iacobellis F, Boriani E, Cudazzo E, Babic F, Geremia C, Bussotti A, Cicconi M, Sarno AD, Mongardini FM, Brescia A, Lenisa L, Mistrangelo M, Zuin M, Mozzon M, Chiriatti AP, Bottino V, Ferronetti A, Rispoli C, Carbone L, Calabro G, Tirro A, de Vito D, Ioia G, Lamanna GL, Asciore L, Greco E, Bianchi P, D'Oriano G, Stazi A, Antonacci N, Renzo RMD, Poto GE, Ferulano GP, Longo A, Docimo L. The Italian Unitary Society of Colon-Proctology (Societa Italiana Unitaria di Colonproctologia) guidelines for the management of acute and chronic hemorrhoidal disease. Ann Coloproctol. 2024 Aug;40(4):287-320. doi: 10.3393/ac.2023.00871.0124. Epub 2024 Aug 30.
- De Gregorio MA, Guirola JA, Serrano-Casorran C, Urbano J, Gutierrez C, Gregorio A, Sierre S, Ciampi-Dopazo JJ, Bernal R, Gil I, De Blas I, Sanchez-Ballestin M, Millera A. Catheter-directed hemorrhoidal embolization for rectal bleeding due to hemorrhoids (Goligher grade I-III): prospective outcomes from a Spanish emborrhoid registry. Eur Radiol. 2023 Dec;33(12):8754-8763. doi: 10.1007/s00330-023-09923-3. Epub 2023 Jul 17.
- van Oostendorp JY, Grossi U, Hoxhaj I, Kimman ML, Kuiper SZ, Breukink SO, Han-Geurts IJM, Gallo G. Limitations of the Goligher classification in randomized trials for hemorrhoidal disease: a qualitative systematic review of selection criteria. Tech Coloproctol. 2025 Jun 10;29(1):133. doi: 10.1007/s10151-025-03170-y.
- Dekker L, Han-Geurts IJM, Grossi U, Gallo G, Veldkamp R. Is the Goligher classification a valid tool in clinical practice and research for hemorrhoidal disease? Tech Coloproctol. 2022 May;26(5):387-392. doi: 10.1007/s10151-022-02591-3. Epub 2022 Feb 9.
- Bozovic B, Radoicic M, Jankovic S, Andelkovic J, Kostic M. Pharmacoeconomic Aspects of Treating Hemorrhoidal Disease-Cost of Illness Study Based on Data from Balkan Country with Recent History of Social and Economic Transition. Iran J Public Health. 2021 Jun;50(6):1288-1290. doi: 10.18502/ijph.v50i6.6433. No abstract available.
- Rorvik HD, Davidsen M, Gierloff MC, Brandstrup B, Olaison G. Quality of life in patients with hemorrhoidal disease. Surg Open Sci. 2023 Feb 24;12:22-28. doi: 10.1016/j.sopen.2023.02.004. eCollection 2023 Mar.
- Pata F, Gallo G, Pellino G, Vigorita V, Podda M, Di Saverio S, D'Ambrosio G, Sammarco G. Evolution of Surgical Management of Hemorrhoidal Disease: An Historical Overview. Front Surg. 2021 Aug 30;8:727059. doi: 10.3389/fsurg.2021.727059. eCollection 2021.
- Roberts K. What Are Hemorrhoids? JAMA. 2025 Nov 6. doi: 10.1001/jama.2025.17253. Online ahead of print.
- Wang L, Ni J, Hou C, Wu D, Sun L, Jiang Q, Cai Z, Fan W. Time to change? Present and prospects of hemorrhoidal classification. Front Med (Lausanne). 2023 Oct 11;10:1252468. doi: 10.3389/fmed.2023.1252468. eCollection 2023.
- Al-Masoudi RO, Shosho R, Alquhra D, Alzahrani M, Hemdi M, Alshareef L. Prevalence of Hemorrhoids and the Associated Risk Factors Among the General Adult Population in Makkah, Saudi Arabia. Cureus. 2024 Jan 3;16(1):e51612. doi: 10.7759/cureus.51612. eCollection 2024 Jan.
- Error in Figure 4 and References. JAMA. 2025 Oct 7;334(13):1203. doi: 10.1001/jama.2025.17752. No abstract available.
Study record dates
Study Major Dates
Study Start (Actual)
Primary Completion (Actual)
Study Completion (Estimated)
Study Registration Dates
First Submitted
First Submitted That Met QC Criteria
First Posted (Actual)
Study Record Updates
Last Update Posted (Actual)
Last Update Submitted That Met QC Criteria
Last Verified
More Information
Terms related to this study
Additional Relevant MeSH Terms
Other Study ID Numbers
- CET 18-2026
Plan for Individual participant data (IPD)
Plan to Share Individual Participant Data (IPD)?
Drug and device information, study documents
Studies a U.S. FDA-regulated drug product
Studies a U.S. FDA-regulated device product
This information was retrieved directly from the website clinicaltrials.gov without any changes. If you have any requests to change, remove or update your study details, please contact register@clinicaltrials.gov. As soon as a change is implemented on clinicaltrials.gov, this will be updated automatically on our website as well.
Clinical Trials on Hemorrhoid Prolapse
-
Renmin Hospital of Wuhan UniversityThe First Affiliated Hospital of Nanchang University; Zhejiang University; Hubei... and other collaboratorsNot yet recruitingHemorrhoids | Hemorrhoid Prolapse | Hemorrhoid | Hemorrhoid Bleeding | Hemorrhoid Pain | Hemorrhoids InternalChina
-
Cardarelli HospitalNeopharmed Gentili S.p.A.RecruitingHemorrhoid | Hemorrhoid Bleeding | Hemorrhoids Prolapse | Hemorrhoid Pain | Topical AdministrationItaly
-
Faming ZhangRecruiting
-
Università degli Studi dell'AquilaCompletedHemorrhoids | Hemorrhoid ProlapseItaly
-
The Second Hospital of Nanjing Medical UniversityRecruitingRectal Prolapse | Internal HemorrhoidChina
-
Fondation Hôpital Saint-JosephRecruiting
-
Medtronic - MITGCompleted
-
The University of Hong KongSchool of Chinese Medicine, The University of Hong KongNot yet recruitingAcupuncture | Hemorrhoidectomy | Hemorrhoid Pain
-
Cardiovascular and Interventional Radiological...Medtronic; Cook Group IncorporatedNot yet recruitingHemorrhoid | Hemorrhoid Bleeding
-
MiMedx Group, Inc.CompletedExternal Hemorrhoid | External Hemorrhoid ThrombosedUnited States
Clinical Trials on Any surgical procedure for hemorrhoidal disease
-
Western Galilee Hospital-NahariyaCompleted
-
University Hospital, AkershusTerminatedSymptomatic Haemorrhoidal DiseaseNorway
-
Linkoeping UniversityRegion ÖstergötlandActive, not recruiting
-
Karadeniz Technical UniversityActive, not recruitingAnterior Cruciate Ligament RuptureTurkey (Türkiye)
-
Clinical Hospital Center RijekaNot yet recruitingCoronary Artery Disease | Ischemic Heart Disease | Stable Angina | Unstable Angina | NSTEMI - Non-ST Segment Elevation MI
-
University of PalermoCompleted
-
University of BolognaRecruitingColorectal Neoplasms MalignantItaly
-
Philipps University MarburgGerman Cancer AidRecruitingCancer of Endometrium Stage I | Cancer of Endometrium Stage IIJapan, Germany, Korea, Republic of
-
Ohio State University Comprehensive Cancer CenterCompletedCervical Carcinoma | Human Papillomavirus InfectionUnited States
-
Boston Scientific CorporationNot yet recruitingExtracranial Carotid Artery Disease | TCAR | Transcarotid Artery Revascularization