[Articles] Know “No” Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP

Constructed negation dataset generating pipeline and negation testing benchmarks on CLIP..

[Articles] AIVariant: a deep learning-based somatic variant detector for highly contaminated tumor samples

Developed AIVariant, a deep learning model which can detect single nucleotide variant (potential tumor) even in low sequence depth.

[Articles] ProPILE: Probing Privacy Leakage in Large Language Models

Developed ProPILE, which can be used both for LLM users and providers to check Personally Identifiable Information(PII) leakage.

[Articles] ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

Devised ILVR, which can be used to guide a diffusion model to generate images based on the reference.

[Articles] OPT-OUT: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport

Developed a new unlearning technique (OPT-OUT), a technique that removes a certain entity’s data without seriously harming model’s performance.