Inference-Time Prompt Projection for Safe Text-to-Image Generation with Total Variation Guarantees > 세미나

본문 바로가기
사이트 내 전체검색


세미나

모드선택 :              
세미나 신청은 모드에서 세미나실 사용여부를 먼저 확인하세요

Inference-Time Prompt Projection for Safe Text-to-Image Generation wit…

김한나 0 1308
구분 박사학위 논문 발표
일정 2026-05-20 16:00 ~ 18:00
강연자 이민혁 (서울대학교)
기타
담당교수 강명주

This thesis investigates how to improve safety in text-to-image generation without unnecessarily degrading benign prompt-image alignment. In text-to-image diffusion models, safety alignment aims to suppress unsafe outputs, while prompt-image alignment requires faithful generation of benign user intent. In practice, stronger safety intervention can improve safety but also distort the underlying conditional generation behavior.

In this thesis, we first formalize this tension through a total variation (TV) perspective. We show that, under a fixed reference conditional generator, any nontrivial reduction in unsafe generations necessarily incurs deviation from the reference distribution, yielding a Safety-Prompt Alignment Trade-off (SPAT). We then propose an inference-only prompt projection framework that selectively rewrites high-risk prompts into a tolerance-controlled safe set while leaving already safe prompts effectively unchanged in practice. To realize this idea, we use a two-stage inference-time cascade in which a large language model proposes candidate rewrites and a vision-language model verifies image-level safety.

Finally, we evaluate the proposed framework on four datasets and three diffusion backbones. Experimental results show that the proposed method consistently improves safety while preserving benign utility near the unaligned reference model. In particular, it achieves 16.7-60.0% relative reductions in inappropriate percentage compared with strong model-level alignment baselines, while maintaining near-reference performance on benign COCO prompts. These results suggest that selective prompt-space intervention provides a practical and theoretically grounded approach to safe text-to-image generation.

세미나명

   

상단으로

Research Institute of Mathematics
서울특별시 관악구 대학동 서울대학교 자연과학대학 129동 305호
Tel. 02-880-6562 / Fax. 02-877-6541 su305@snu.ac.kr

COPYRIGHT ⓒ 자연과학대학 수학연구소 ALL RIGHT RESERVED.