Jiaqi Wang's Homepage
Jiaqi Wang's Homepage
Home
Publications
Projects
Honors and Awards
Light
Dark
Automatic
Publications
Type
Conference paper
Journal article
Date
2024
2023
2022
2021
2020
2019
2018
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
PDF
Cite
Code
Long-CLIP: Unlocking the Long-Text Capability of CLIP
PDF
Cite
Code
RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition
PDF
Cite
Code
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
PDF
Cite
Code
Project
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models
PDF
Cite
Code
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
PDF
Cite
Code
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
PDF
Cite
Code
Project
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
PDF
Cite
Code
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
PDF
Cite
Code
Project
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
PDF
Cite
Code
Project
OneLLM: One Framework to Align All Modalities with Language
PDF
Cite
Code
Project
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
PDF
Cite
Code
Project
VIGC: Visual Instruction Generation and Correction
PDF
Cite
Code
Project
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
PDF
Cite
Code
Project
V3Det: Vast Vocabulary Visual Detection Dataset
PDF
Cite
Code
Project
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
PDF
Cite
Code
Project
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation
PDF
Cite
Code
Project
Multi-level Logit Distillation
PDF
Cite
Code
BUOL: A Bottom-Up Framework with Occupancy-aware Lifting for Panoptic 3D Scene Reconstruction From A Single Image
PDF
Cite
Code
Dense Distinct Query for End-to-End Object Detection
PDF
Cite
Code
Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction
PDF
Cite
Code
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences
PDF
Cite
Code
Semi-Supervised Semantic Segmentation via Gentle Teaching Assistant
PDF
Cite
Code
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
PDF
Cite
Code
Few-Shot Object Detection via Association and DIscrimination
PDF
Cite
Code
Texture Memory-Augmented Deep Patch-Based Image Inpainting
PDF
Cite
Code
CARAFE++: Unified Content-Aware ReAssembly of FEatures
PDF
Cite
Code
Seesaw Loss for Long-Tailed Instance Segmentation
PDF
Cite
Code
Side-Aware Boundary Localization for More Precise Object Detection
PDF
Cite
Code
CARAFE: Content-Aware ReAssembly of FEatures
PDF
Cite
Code
Region Proposal by Guided Anchoring
PDF
Cite
Code
Hybrid Task Cascade for Instance Segmentation
PDF
Cite
Code
Optimizing Video Object Detection via a Scale-Time Lattice
PDF
Cite
Cite
×