Improving Agent Design, JPEG-LM's Visual Breakthrough, TurboEdit's Real-Time Image Edits, Video Segmentation Advances, LLMs Learning Like Humans, RL Benchmarks
MP3•Pagina episodului
Manage episode 435391262 series 3568650
Content provided by PocketPod. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by PocketPod or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Automated Design of Agentic Systems TurboEdit: Instant text-based image editing Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning Fine-tuning Large Language Models with Human-inspired Learning Strategies in Medical Question Answering D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
…
continue reading
70 episoade