Paper 8
- FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding
- M4I: Multi-modal Models Membership Inference
- AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
- AdvLogo: Adversarial Patch Attack against Object Detectors based on Diffusion Models
- Scalable Diffusion Models with Transformers
- Don't trust your eyes: on the (un)reliability of feature visualizations
- Please Tell Me More: Privacy Impact of Explainability through the Lens of Membership Inference Attack
- DeepSeek-R1 : Incentivizing Reasoning Capability in LLMs via Reinforcement Learning