Popular repositories Loading
- 
      Cherry_LLMCherry_LLM Public[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models 
- 
      Reflection_TuningReflection_Tuning Public[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning 
- 
      HallusionBenchHallusionBench Public[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models 
- 
      SuperfilteringSuperfiltering Public[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning 
- 
      MoE-EmbeddingMoE-Embedding Public[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free" 
- 
      MiP-OverthinkingMiP-Overthinking Public[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? 
Repositories
-           HallusionBench Public[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models tianyi-lab/HallusionBench’s past year of commit activity 
-           ColorBench Public[NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness tianyi-lab/ColorBench’s past year of commit activity 
-           FaSTAR PublicFast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing tianyi-lab/FaSTAR’s past year of commit activity 
-           Superfiltering Public[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning tianyi-lab/Superfiltering’s past year of commit activity 
-           Cherry_LLM Public[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models tianyi-lab/Cherry_LLM’s past year of commit activity 
-           MiP-Overthinking Public[COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? tianyi-lab/MiP-Overthinking’s past year of commit activity 
Top languages
Loading…
Most used topics
Loading…