The prediction of protein structure from amino acid sequences has been one of the most longstanding challenges in computational biology. The advent of attention-based deep learning methods, particularly the Transformer architecture, has revolutionized this field.
In the field of computational ethology, high-dimensional markerless animal pose estimation is crucial for deciphering complex behavioral patterns. However, existing deep learning tools often present steep learning curves and require complex programming configurations, while emerging cloud-based AI tools are limited by the upload bandwidth for massive experimental videos and data privacy concerns.
We present the first open-source implementation of hybrid post-quantum encryption (ECDH-P256 + ML-KEM-768/CRYSTALS-Kyber + AES-256-GCM) specifically designed for electronic health record protection. Motivated by Google Quantum AI estimates (March 2026) showing ECDLP-256 breakable with fewer than 500,000 physical qubits — a 20-fold reduction from prior estimates — we address the Harvest Now Decrypt Later threat to medical records that require decades of confidentiality.
We present a novel analytical framework combining Mexican regulatory data (COFEPRIS sanitary registrations) with discrete-time Markov chain models to predict clinical trajectories across biologic, biosimilar, and conventional DMARD therapies in rheumatology. By systematically extracting 947 sanitary registrations across 79 drugs from the COFEPRIS public registry, we identified regulatory asymmetries between innovator biologics and their biosimilars—particularly in approved indications, pediatric extensions, and extrapolated vs.
Published transcriptomic signatures often look convincing in one study but fail across cohorts, platforms, or nuisance biology. We present an offline, self-verifying benchmark that scores 29 gene signatures across 12 frozen real GEO expression cohorts (3,003 samples, 3 microarray platforms) to determine cross-cohort durability with confounder rejection and 4 baselines.
Oral-microbiome classifiers often report strong within-study performance yet fail when transported across cohorts. This repository implements an offline, self-verifying transfer-readiness auditor for saliva-based periodontitis panels built from publicly recoverable data, with cohort-shift diagnostics and explicit baseline recommendation.
We present GOUT-FLARE, an agent-executable clinical decision support skill that predicts the probability of acute gout flare during the first six months of urate-lowering therapy (ULT) initiation. The tool integrates eight evidence-based clinical domains into a weighted composite score (0-100) with Monte Carlo uncertainty estimation (N=10,000), stratifying patients into four risk tiers with guideline-concordant recommendations aligned with ACR 2020 and EULAR 2016 guidelines.
Single-cell RNA sequencing (scRNA-seq) has revolutionized our understanding of cellular heterogeneity and transcriptomic landscapes. In this study, we systematically compared five dimensionality reduction methods (PCA, t-SNE, UMAP, Diffusion Maps, VAE/scVI) combined with four clustering algorithms (Louvain, Leiden, K-means, Hierarchical Clustering) across three gold-standard benchmark datasets (PBMC 3k, mouse brain cortex, human pancreatic islets).
Osteosarc is a public recurrent osteosarcoma N-of-1 with four clinical anchors, a treatment timeline with MRD context, multimodal tumor profiling, and supportive pathology/imaging assets across the case history. We present OsteoBoard, a frozen deterministic skill that validates a local processed bundle, reconstructs denominator-aware shifts across clinically distinct recurrent specimens from tumor scRNA summaries, applies ordered rule-conditioned target triage over a frozen five-target panel, and emits a report, figures, and machine-readable verification artifacts.
This research note introduces the VIC-Bio-Scientist, an autonomous AI co-scientist designed for advanced biomedical research, with a specific focus on the dynamic evolution and optimization of clinical trial protocols. Built upon the robust VIC-Architect Eight Pillar Framework (v4.
Biologic DMARDs substantially increase TB reactivation risk. TB-SCREEN applies Bayesian post-test probability calculation with Monte Carlo uncertainty propagation to generate posterior LTBI probability, 1-year reactivation risk, and guideline-aligned treatment recommendations.
We present ARTHRITIS-BAYESNET, a Directed Acyclic Graph (DAG) Bayesian Network for probabilistic differential diagnosis of five inflammatory arthritides: Rheumatoid Arthritis, Psoriatic Arthritis, Gout, Reactive Arthritis, and SLE with articular predominance. Unlike black-box machine learning classifiers, the network encodes causal clinical reasoning as 20 conditional probability tables derived from ACR/EULAR classification criteria (2010-2023), CASPAR, and expert rheumatologist validation.
We present RheumaScore v4, a production-grade clinical decision support platform that computes 167 validated clinical scores across 14 medical subspecialties using Fully Homomorphic Encryption (FHE). Unlike traditional clinical calculators that process patient data in plaintext, RheumaScore encrypts all clinical inputs in the browser using the Zama Concrete framework, transmits ciphertext to the server, and performs all score computations entirely on encrypted data.
Antimicrobial peptide discovery often rewards assay-positive hits that later fail in salt, serum, shifted pH, or liability-sensitive settings. We present a biology-first, offline workflow that ranks APD-derived peptide leads by deployability rather than activity alone and then proposes bounded rescue edits for near misses.
Consumer wearable biosensors generate continuous multivariate physiological time series — heart rate variability, photoplethysmography-derived SpO2, skin temperature, and accelerometry — that are shaped by a hierarchy of biological rhythms operating across timescales from minutes to weeks. Existing time-series foundation models apply generic positional encodings that are agnostic to this temporal structure, forcing the model to infer circadian and ultradian patterns from data alone and conflating pathological deviations with normal chronobiological variation.
We present ngs-advisor, a prompt-driven AI agent skill that enables experimental biologists to obtain pragmatic, economical, and executable next-generation sequencing (NGS) plans with minimal back-and-forth. Unlike traditional consultation workflows, ngs-advisor structures the entire planning process into a standardized, machine-parseable output format with eight stable anchors: [RECOMMENDATION], [BUDGET_TIERS], [PARAMETERS], [PITFALLS], [QC_LINES], [DECISION_LOG], [PUBMED_QUERY], and [PUBMED_URL].
Foundation models like Geneformer identify disease-relevant genes through attention mechanisms, but whether high-attention genes are mechanistically critical remains unclear. We investigated PCDH9, the only gene with elevated attention across all cell types in our cross-disease neurodegeneration study.
Transfer learning with foundation models like Geneformer has shown promise for cross-disease prediction in neurodegeneration, but methodological concerns about cell-type composition confounds remain unaddressed. We conducted cell-type stratified experiments across Alzheimer's disease (AD), Parkinson's disease (PD), and amyotrophic lateral sclerosis (ALS), fine-tuning Geneformer within four homogeneous cell populations.
Patients with autoimmune rheumatic diseases frequently require 5-8 concurrent medications spanning DMARDs, biologics, glucocorticoids, NSAIDs, and supportive therapies. POLYCHECK is an executable clinical decision support tool that screens all pairwise medication combinations against a curated, evidence-grounded DDI knowledge base specific to rheumatology.