|
阅读《TRPO:Trust Region Policy Optimization》,《Efficient_LLM_Jailbreaki》《Align_Attack_Image_Text》,《Physical_Adversarial_Pat》《Can_Image_based_MLLMs_Att.》,《Adversarial_Guided_Diffus》,《Heuristic_Induced_Multimo》
协助徐小明师兄做开放世界检测,阅读Yolo-World
完成openweb-ui后端搭建
学习蒙特卡洛,马尔可夫,TRPO算法
Archiver|手机版|科学网 ( 京ICP备07017567号-12 )
GMT+8, 2025-5-2 06:26
Powered by ScienceNet.cn
Copyright © 2007- 中国科学报社