Red Teaming o1 Part 1/2– Automated Jailbreaking with Haize Labs' Leonard Tang, Aidan Ewart, and Brian Huang

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

#Tech #Society #Entrepreneur #Business #Turpentine #Erik Torenberg #Nathan Labenz #AI #Artificial Intelligence #Founders

47:35

“Software is eating the world.” That was written over 14 years ago, back in 2011 by Marc Andreesen, and it seems like that is growing more true every day. Today we’re going to talk about the increasingly digital nature of businesses - all businesses - as well as how the concept of composability, which has been applied effectively in the world of software, might just have applications well beyond technology infrastructure. To help me discuss this topic, I’d like to welcome Chris Bach, Co-Founder of Netlify. RESOURCES Wix Studio is the ultimate web platform for creative, fast-paced teams at agencies and enterprises—with smart design tools, flexible dev capabilities, full-stack business solutions, multi-site management, advanced AI and fully managed infrastructure. https://www.wix.com/studio Don't miss Medallia Experience 2025, March 24-26 in Las Vegas: Registration is now available: https://cvent.me/AmO1k0 Use code MEDEXP25 for $200 off registration Register now for HumanX 2025. This AI-focused event which brings some of the most forward-thinking minds in technology together. Register now with the code "HX25p_tab" for $250 off the regular price. Connect with Greg on LinkedIn: https://www.linkedin.com/in/gregkihlstrom Don't miss a thing: get the latest episodes, sign up for our newsletter and more: https://www.theagilebrand.show Check out The Agile Brand Guide website with articles, insights, and Martechipedia, the wiki for marketing technology: https://www.agilebrandguide.com The Agile Brand podcast is brought to you by TEKsystems. Learn more here: https://www.teksystems.com/versionnextnow The Agile Brand is produced by Missing Link—a Latina-owned strategy-driven, creatively fueled production co-op. From ideation to creation, they craft human connections through intelligent, engaging and informative content. https://www.missinglink.company…

acum un an 1:10:09

MP3•Pagina episodului

In this Emergency Pod of The Cognitive Revolution, Nathan provides crucial insights into OpenAI's new o1 and o1-mini reasoning models. Featuring exclusive interviews with members of the o1 Red Team from Apollo Research and Haize Labs, we explore the models' capabilities, safety profile, and OpenAI's pre-release testing approach. Dive into the implications of these advanced AI systems, including their potential to match or exceed expert performance in many areas. Join us for an urgent and informative discussion on the latest developments in AI technology and their impact on the future.

Papers mentioned:

Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/

SPONSORS:

Oracle: Oracle Cloud Infrastructure (OCI) is a single platform for your infrastructure, database, application development, and AI needs. OCI has four to eight times the bandwidth of other clouds; offers one consistent price, and nobody does data better than Oracle. If you want to do more and spend less, take a free test drive of OCI at https://oracle.com/cognitive

Brave: The Brave search API can be used to assemble a data set to train your AI models and help with retrieval augmentation at the time of inference. All while remaining affordable with developer first pricing, integrating the Brave search API into your workflow translates to more ethical data sourcing and more human representative data sets. Try the Brave search API for free for up to 2000 queries per month at https://bit.ly/BraveTCR

Omneky: Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/

Squad: Head to Squad to access global engineering without the headache and at a fraction of the cost: head to https://choosesquad.com/ and mention “Turpentine” to skip the waitlist.

RECOMMENDED PODCAST:

This Won't Last.

Eavesdrop on Keith Rabois, Kevin Ryan, Logan Bartlett, and Zach Weinberg's monthly backchannel. They unpack their hottest takes on the future of tech, business, venture, investing, and politics.

Apple Podcasts: https://podcasts.apple.com/us/podcast/id1765665937

Spotify: https://open.spotify.com/show/2HwSNeVLL1MXy0RjFPyOSz

YouTube: https://www.youtube.com/@ThisWontLastpodcast

CHAPTERS:

(00:00:00) About the Show

(00:00:22) About the Episode

(00:05:03) Introduction and Haize Labs Overview

(00:07:36) Universal Jailbreak Technique and Attacks

(00:13:47) Automated vs Manual Red Teaming

(00:17:15) Qualitative Assessment of Model Jailbreaking (Part 1)

(00:19:38) Sponsors: Oracle | Brave

(00:21:42) Qualitative Assessment of Model Jailbreaking (Part 2)

(00:26:21) Context-Specific Safety Considerations

(00:32:26) Model Capabilities and Safety Correlation (Part 1)

(00:36:22) Sponsors: Omneky | Squad

(00:37:48) Model Capabilities and Safety Correlation (Part 2)

(00:44:42) Model Behavior and Defense Mechanisms

(00:52:47) Challenges in Preventing Jailbreaks

(00:56:24) Safety, Capabilities, and Model Scale

(01:00:56) Model Classification and Preparedness

(01:04:40) Concluding Thoughts on o1 and Future Work

(01:05:54) Outro

211 episoade

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Red Teaming o1 Part 1/2– Automated Jailbreaking with Haize Labs' Leonard Tang, Aidan Ewart, and Brian Huang

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

72 subscribers

published acum un an

Distribuie

MP3•Pagina episodului

Papers mentioned:

Apply to join over 400 Founders and Execs in the Turpentine Network: https://www.turpentinenetwork.co/

SPONSORS:

Squad: Head to Squad to access global engineering without the headache and at a fraction of the cost: head to https://choosesquad.com/ and mention “Turpentine” to skip the waitlist.

RECOMMENDED PODCAST:

This Won't Last.

Eavesdrop on Keith Rabois, Kevin Ryan, Logan Bartlett, and Zach Weinberg's monthly backchannel. They unpack their hottest takes on the future of tech, business, venture, investing, and politics.

Apple Podcasts: https://podcasts.apple.com/us/podcast/id1765665937

Spotify: https://open.spotify.com/show/2HwSNeVLL1MXy0RjFPyOSz

YouTube: https://www.youtube.com/@ThisWontLastpodcast

CHAPTERS:

(00:00:00) About the Show

(00:00:22) About the Episode

(00:05:03) Introduction and Haize Labs Overview

(00:07:36) Universal Jailbreak Technique and Attacks

(00:13:47) Automated vs Manual Red Teaming

(00:17:15) Qualitative Assessment of Model Jailbreaking (Part 1)

(00:19:38) Sponsors: Oracle | Brave

(00:21:42) Qualitative Assessment of Model Jailbreaking (Part 2)

(00:26:21) Context-Specific Safety Considerations

(00:32:26) Model Capabilities and Safety Correlation (Part 1)

(00:36:22) Sponsors: Omneky | Squad

(00:37:48) Model Capabilities and Safety Correlation (Part 2)

(00:44:42) Model Behavior and Defense Mechanisms

(00:52:47) Challenges in Preventing Jailbreaks

(00:56:24) Safety, Capabilities, and Model Scale

(01:00:56) Model Classification and Preparedness

(01:04:40) Concluding Thoughts on o1 and Future Work

(01:05:54) Outro

211 episoade

#Tech #Society #Entrepreneur #Business #Turpentine #Erik Torenberg #Nathan Labenz #AI #Artificial Intelligence #Founders

Toate episoadele

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Material Progress: Developing AI's Scientific Intuition, with Orbital Materials' Jonathan Godwin & Tim Duignan 1:40:50

acum 1 day1:40:50

1:40:50

Jonathan Godwin, founder and CEO of Orbital Materials, alongside researcher Tim Duignan, discuss the transformative potential of AI in material science on the Cognitive Revolution podcast. They explore foundational concepts, the integration of computational simulations, and the development of new materials for various applications such as data centers and combating climate change. They also delve into the latest advancements, including a groundbreaking study on the potassium ion channel, and speculate on the future of AI in scientific discovery and material synthesis. Check out some of Tim's work: Google Colab to run you own simulation: https://colab.research.google.com/github/timduignan/orb-models/blob/main/examples/OrbMDTut.ipynb GitHub repository "Orb force fields": https://github.com/orbital-materials/orb-models Preprint "A potassium ion channel simulated with a universal neural network potential": https://arxiv.org/abs/2411.18931 Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive Shopify: Dreaming of starting your own business? Shopify makes it easier than ever. With customizable templates, shoppable social media posts, and their new AI sidekick, Shopify Magic, you can focus on creating great products while delegating the rest. Manage everything from shipping to payments in one place. Start your journey with a $1/month trial at https://shopify.com/cognitive and turn your 2025 dreams into reality. Vanta: Vanta simplifies security and compliance for businesses of all sizes. Automate compliance across 35+ frameworks like SOC 2 and ISO 27001, streamline security workflows, and complete questionnaires up to 5x faster. Trusted by over 9,000 companies, Vanta helps you manage risk and prove security in real time. Get $1,000 off at https://vanta.com/revolution CHAPTERS: (00:00) Teaser (01:05) About the Episode (05:10) Welcome to Orbital (06:15) Semiconductors (07:44) Material Science Today (09:22) Experimental Cycle (12:06) Orbital’s Founding (14:51) AI in Materials (Part 1) (21:05) Sponsors: OCI | NetSuite (23:45) AI in Materials (Part 2) (35:00) Sponsors: Shopify | Vanta (38:15) Generative Models (38:16) Diffusion Models (50:50) Orbital Applications (58:19) Perfect Sponge (59:43) AI Simulations (01:01:27) Natural Language (01:02:35) Compute Needs (01:05:05) Human Electrical Nature (01:06:11) Potassium Channels (01:15:51) Scaling Simulations (01:23:56) Roadmap: Carbon Removal (01:30:37) AI & Job Satisfaction (01:36:14) LLMs & Potentials (01:37:19) AGI & Discovery (01:39:58) Outro…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott. 2:10:23

acum 6 zile2:10:23

2:10:23

In this episode of The Cognitive Revolution, Nathan explores the groundbreaking paper on obfuscated activations with 3 members from the research team - Luke Bailey, Eric Jenner, and Scott Emmons. The team discusses how their work challenges latent-based defenses in AI systems, demonstrating methods to bypass safety mechanisms while maintaining harmful behaviors. Join us for an in-depth technical conversation about AI safety, interpretability, and the ongoing challenge of creating robust defense systems. Do check out the "Obfuscated Activations Bypass LLM Latent-Space Defenses" paper here: https://obfuscated-activations.github.io/ Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive Shopify: Dreaming of starting your own business? Shopify makes it easier than ever. With customizable templates, shoppable social media posts, and their new AI sidekick, Shopify Magic, you can focus on creating great products while delegating the rest. Manage everything from shipping to payments in one place. Start your journey with a $1/month trial at https://shopify.com/cognitive and turn your 2025 dreams into reality. Vanta: Vanta simplifies security and compliance for businesses of all sizes. Automate compliance across 35+ frameworks like SOC 2 and ISO 27001, streamline security workflows, and complete questionnaires up to 5x faster. Trusted by over 9,000 companies, Vanta helps you manage risk and prove security in real time. Get $1,000 off at https://vanta.com/revolution RECOMMENDED PODCAST: Check out Modern Relationships where Erik Torenberg interviews tech power couples and leading thinkers to explore how ambitious people actually make partnerships work. This season's guests include: Delian Asparouhov & Nadia Asparouhova, Kristen Berman & Phil Levin, Rob Henderson, and Liv Boeree & Igor Kurganov. Apple: https://podcasts.apple.com/us/podcast/id1786227593 Spotify: https://open.spotify.com/show/5hJzs0gDg6lRT6r10mdpVg YouTube: https://www.youtube.com/@ModernRelationshipsPod CHAPTERS: (00:00:00) Teaser (00:00:46) About the Episode (00:05:11) Latent Space Defenses (00:08:41) Sleeper Agents (00:15:06) Three Case Studies (Part 1) (00:17:02) Sponsors: Oracle Cloud Infrastructure (OCI) | NetSuite (00:19:42) Three Case Studies (Part 2) (00:24:09) SQL Generation (00:26:17) Understanding Defenses (00:32:52) Out-of-Distribution Detection (Part 1) (00:35:37) Sponsors: Shopify | Vanta (00:38:52) Out-of-Distribution Detection (Part 2) (00:45:13) Loss Function Weighting (00:57:49) Who Moves Last? (01:11:41) High-Level Triggers (01:25:33) Open Source vs. Access (01:38:57) Internalizing Reasoning (01:53:07) Representing Concepts (02:06:38) Final Thoughts (02:09:33) Outro…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Gene Hunting with o1-pro: Reasoning about Rare Diseases with ChatGPT Pro Grantee Dr. Catherine Brownstein 1:33:29

acum 8 zile1:33:29

1:33:29

Nathan explores the cutting-edge intersection of AI and rare disease research with Dr. Catherine Brownstein of Boston Children's Hospital and Harvard Medical School. In this episode of The Cognitive Revolution, we dive into how frontier AI models are revolutionizing the diagnosis of rare diseases. Join us for an insightful conversation with a ChatGPT Pro grant winner who's pioneering the use of AI to help patients find answers faster. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse Check out Modern Relationships, where Erik Torenberg interviews tech power couples and leading thinkers to explore how ambitious people actually make partnerships work. This season's guests include: Delian Asparouhov & Nadia Asparouhova, Kristen Berman & Phil Levin, Rob Henderson, and Liv Boeree & Igor Kurganov. Apple: https://podcasts.apple.com/us/podcast/id1786227593 Spotify: https://open.spotify.com/show/5hJzs0gDg6lRT6r10mdpVg YouTube: https://www.youtube.com/@ModernRelationshipsPod SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive Shopify: Dreaming of starting your own business? Shopify makes it easier than ever. With customizable templates, shoppable social media posts, and their new AI sidekick, Shopify Magic, you can focus on creating great products while delegating the rest. Manage everything from shipping to payments in one place. Start your journey with a $1/month trial at https://shopify.com/cognitive and turn your 2025 dreams into reality. Vanta: Vanta simplifies security and compliance for businesses of all sizes. Automate compliance across 35+ frameworks like SOC 2 and ISO 27001, streamline security workflows, and complete questionnaires up to 5x faster. Trusted by over 9,000 companies, Vanta helps you manage risk and prove security in real time. Get $1,000 off at https://vanta.com/revolution CHAPTERS: (00:00:00) Teaser (00:00:56) About the Episode (00:04:45) Rare Diseases Common (00:06:48) Patient Journey (00:12:57) Genome Sequencing (00:19:39) Sponsors: Oracle Cloud Infrastructure (OCI) | NetSuite (00:22:19) Diagnosis Process (00:30:50) Data Pipelines (00:35:51) Sponsors: Shopify | Vanta (00:39:07) Interaction Graphs (00:42:18) Data Accessibility (00:43:42) AI in Pipelines (00:45:40) LLM Impact (00:48:40) Anomaly Detection (00:52:07) Data Sharing (00:58:49) Data Reform (01:02:41) AI's Potential (01:04:30) AI Applications (01:06:57) Prompt Engineering (01:14:51) Model Comparison (01:19:16) Prompting Insights (01:22:14) Move 37 Analogy (01:24:34) Future Potential (01:29:27) Future Experience (01:32:39) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
AI AMA – Part 2: AI Utopia, Consciousness, and the Future of Work 2:01:36

acum 15 zile2:01:36

2:01:36

In this second part of the special AMA episode, Nathan explores profound questions about AI's future and its impact on society. From painting a picture of AI utopia to discussing the challenges of consciousness and potential doom scenarios, Nathan shares insights on how we might adapt and thrive in an AI-transformed world. Join us for a thought-provoking conversation that delves into the practical strategies for engaging with AI, the role of safety measures, and the importance of maintaining ethical considerations as we navigate this technological revolution. Check out http://aipodcast.ing for AI-powered podcast production services or reach out to Adithyan for more information. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amodei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive. CHAPTERS: (00:00:00) Teaser (00:00:56) AI Utopia (00:05:48) Adapting to AI (00:08:01) Probability of Utopia (00:10:51) Challenging Worldviews (Part 1) (00:11:02) Sponsors: Oracle Cloud Infrastructure (OCI) | 80,000 Hours (00:13:42) Challenging Worldviews (Part 2) (00:23:50) Audience Questions (Part 1) (00:24:07) Sponsors: NetSuite (00:25:39) Audience Questions (Part 2) (00:30:15) AI in Various Fields (00:33:16) AI in Psychiatry (00:36:16) Superintelligence (00:40:50) Societal Shift with ASI (00:49:27) Doom Discourse (00:57:05) Existential Risk (01:05:53) AI Takeover (01:14:30) AI Safety Efforts (01:18:36) Model Release Secrecy (01:27:20) AI Consciousness (01:37:51) Practical AI Strategies (01:50:34) Book Recommendation (01:59:34) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.appl...…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
AI AMA – Part 1: OpenAI’s o3, Deliberative Alignment, and AI Surprises of 2024 2:06:57

acum 16 zile2:06:57

2:06:57

In this special AMA episode, Nathan answers questions posed by The Cognitive Revolution podcast listeners. He discusses AI developments in 2024, including OpenAI's o3 announcement, deliberative alignment, and the future of AI technology. It is an insightful discussion about AI's impact on education, coding careers, and business sustainability in an AI-driven world. Check out http://aipodcast.ing for AI-powered podcast production services or reach out to Adithyan for more information. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amodei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive. CHAPTERS: (00:00:00) Teaser (00:01:00) AI Podcasting (00:03:19) o3 vs. Other Models (00:10:25) o3 Breaks Benchmarks (Part 1) (00:14:19) Sponsors: Oracle Cloud Infrastructure (OCI) | 80,000 Hours (00:16:59) o3 Breaks Benchmarks (Part 2) (00:27:51) OpenAI's Safety Plan (Part 1) (00:28:45) Sponsors: NetSuite (00:30:18) OpenAI's Safety Plan (Part 2) (00:39:08) Safety & Governance (00:50:48) Tail of the Tape (00:59:38) Underutilized Potential (01:05:07) RAG & State Space (01:18:14) Agentic Frameworks (01:29:55) AI & Education (01:35:59) Learn to Code? (01:43:00) Defensible Moats? (01:53:53) UBI & Wealth (01:59:08) Contributing to AI Safety (02:04:25) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Teaching AI to See: A Technical Deep-Dive on Vision Language Models with Will Hardman of Veratai 3:56:09

acum 21 zile3:56:09

3:56:09

In this episode of The Cognitive Revolution, Nathan hosts Will Hardman, founder of AI advisory firm Veritai, for a comprehensive technical survey of vision language models (VLMs). We explore the evolution of VLMs from early vision transformers to state-of-the-art architectures like InternVL and Llama3V, examining key innovations and architectural decisions. Join us for an in-depth discussion covering multimodality in AI systems, evaluation frameworks, and practical applications with one of the field's leading experts. Here's to the link to one of the most comprehensive reference documents for VLMs prepared by Will Hardman: https://dust-mailbox-c73.notion.site/Vision-Language-Models-11b675d75dd480af994cc474a754bb26 Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers. OCI powers industry leaders like Vodafone and Thomson Reuters with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before March 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amodei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. CHAPTERS: (00:00:00) Teaser (00:00:55) About the Episode (00:05:45) Introduction (00:09:16) VLM Use Cases (00:13:47) Vision Transformers (Part 1) (00:17:48) Sponsors: Oracle Cloud Infrastructure (OCI) (00:19:00) Vision Transformers (Part 2) (00:24:58) OpenAI's CLIP Model (00:33:44) DeepMind's Flamingo (Part 1) (00:33:44) Sponsors: 80,000 Hours (00:35:17) DeepMind's Flamingo (Part 2) (00:48:29) Instruction Tuning with LAVA (01:09:25) MMMU Benchmark (01:14:42) Pre-training with QNVL (01:32:13) InternVL Model Series (01:52:33) Cross-Attention vs. Self-Attention (02:14:33) Hybrid Architectures (02:31:08) Early vs. Late Fusion (02:34:50) VQA and DocVQA Benchmarks (02:40:08) The Blink Benchmark (03:05:37) Generative Pre-training (03:15:26) Multimodal Generation (03:37:00) Frontier Labs & Benchmarks (03:47:45) Conclusion (03:53:28) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
roon's Heroic Duty: Will "the Good Guys" Build AGI First? (from Doom Debates) 1:57:58

acum 27 zile1:57:58

1:57:58

In this episode of The Cognitive Revolution, Nathan shares a fascinating cross-post from Doom Debates featuring a conversation between Liron Shapira and roon, an influential Twitter Anon from OpenAI's technical staff. They explore crucial insights into how OpenAI's team views AI's future, including discussions on AGI development, alignment challenges, and extinction risks. Join us for this thought-provoking analysis of AI safety and the mindset of those building transformative AI systems. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: GiveWell: GiveWell has spent over 17 years researching global health and philanthropy to identify the highest-impact giving opportunities. Over 125,000 donors have contributed more than $2 billion, saving over 200,000 lives through evidence-backed recommendations. First-time donors can have their contributions matched up to $100 before year-end. Visit https://GiveWell.org, select podcast, and enter Cognitive Revolution at checkout to make a difference today. SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive Weights & Biases RAG++: Advanced training for building production-ready RAG applications. Learn from experts to overcome LLM challenges, evaluate systematically, and integrate advanced features. Includes free Cohere credits. Visit https://wandb.me/cr to start the RAG++ course today. CHAPTERS: CHAPTERS: (00:00:00) About the Episode (00:07:18) Introducing roon (00:09:13) roon's Background (00:16:40) roon the Person (Part 1) (00:21:56) Sponsors: GiveWell | SelectQuote (00:24:45) roon the Person (Part 2) (00:26:43) Excitement in AI (00:31:59) Creativity in AI (00:40:18) Sponsors: Oracle Cloud Infrastructure (OCI) | Weights & Biases RAG++ (00:42:36) roon's P(Doom) (00:52:25) AI Risk & Regulation (00:53:51) AI Timelines (01:01:20) Aligned by Default? (01:09:16) Training vs Production (01:14:30) Open Source AI Risk (01:26:25) Goal-Oriented AI (01:34:29) Pause AI? (01:39:46) Dogecoin & Wrap Up (01:41:06) Outro & Call to Action (01:56:38) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast...…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Emad Mostaque on the Intelligent Internet and Universal Basic AI 2:11:52

acum 30 zile2:11:52

2:11:52

In this episode of The Cognitive Revolution, Nathan interviews Emad Mostaque, former Founder and CEO of Stability AI and Founder of The Intelligent Internet. We explore humanity's future with AI, from the stark 50-50 survival odds to Emad's optimistic vision for universal basic intelligence. Join us for a fascinating discussion about open-source AI infrastructure, the three-tier system of the Intelligent Internet, and how blockchain technology might help fund global public goods in AI development. Check out Emad's publications on: Emad's Twitter: https://x.com/emostaque Emad's Blog: https://emad.posthaven.com/ Intelligent Internet Substack: https://intelligentinternet.substack.com/p/ii-intelligent-internet-primer?r=47215t&utm_campaign=post&utm_medium=web&triedRedirect=true The Cognitive Revolution Ask Me Anything and Listener Survey: https://docs.google.com/forms/d/1aYv2XLID7RqGxj2_Y4_6x9mo_aqXcGCeLw1EQhy4IpY/edit Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: GiveWell: GiveWell has spent over 17 years researching global health and philanthropy to identify the highest-impact giving opportunities. Over 125,000 donors have contributed more than $2 billion, saving over 200,000 lives through evidence-backed recommendations. First-time donors can have their contributions matched up to $100 before year-end. Visit https://GiveWell.org, select podcast, and enter Cognitive Revolution at checkout to make a difference today. SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amodei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. CHAPTERS: (00:00:00) Teaser (00:00:36) About the Episode (00:04:33) Intro (00:09:15) AI Risk (00:16:58) Sponsors: GiveWell | SelectQuote (00:19:48) AI Goals (00:23:52) AI Divergence (00:27:50) AI & Agency (00:32:18) Sponsors: Oracle Cloud Infrastructure (OCI) | 80,000 Hours (00:34:57) Kids & AI (00:39:50) Intelligent Internet (00:48:37) Open vs. Closed AI (00:53:30) AI Runaway (01:01:46) Building the Future (01:05:43) Energy & AI (01:15:19) Hypernodes (01:27:36) Proof of Beneficial Compute (01:38:28) Distributed Compute (01:45:10) Intelligent Internet Company (01:48:37) Finding Talent (01:55:33) Pause Letter (01:59:50) Regulation (02:04:04) Speed Limits (02:06:42) Data Filtering (02:10:54) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Can AIs do AI R&D? Reviewing REBench Results with Neev Parikh of METR 1:47:58

acum 5 weeks1:47:58

1:47:58

In this episode of The Cognitive Revolution, Nathan explores METR's groundbreaking REBench evaluation framework with Neev Parikh. We dive deep into how this new benchmark assesses AI systems' ability to perform real machine learning research tasks, from optimizing GPU kernels to fine-tuning language models. Join us for a fascinating discussion about the current capabilities of AI models like Claude 3.5 and GPT-4, and what their performance tells us about the trajectory of artificial intelligence development. Check out METR's work: blog post: https://metr.org/blog/2024-11-22-evaluating-r-d-capabilities-of-llms/ paper: https://metr.org/AI_R_D_Evaluation_Report.pdf jobs: https://hiring.metr.org/ The Cognitive Revolution Ask Me Anything and Listener Survey: https://docs.google.com/forms/d/1aYv2XLID7RqGxj2_Y4_6x9mo_aqXcGCeLw1EQhy4IpY/edit Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: GiveWell: GiveWell has spent over 17 years researching global health and philanthropy to identify the highest-impact giving opportunities. Over 125,000 donors have contributed more than $2 billion, saving over 200,000 lives through evidence-backed recommendations. First-time donors can have their contributions matched up to $100 before year-end. Visit https://GiveWell.org, select podcast, and enter Cognitive Revolution at checkout to make a difference today. SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive Weights & Biases RAG++: Advanced training for building production-ready RAG applications. Learn from experts to overcome LLM challenges, evaluate systematically, and integrate advanced features. Includes free Cohere credits. Visit https://wandb.me/cr to start the RAG++ course today. CHAPTERS: (00:00:00) Teaser (00:01:04) About the Episode (00:05:14) Introducing METR (00:07:36) Specialization of AI Risk (00:09:52) AI R&D vs. Autonomy (00:12:41) Benchmark Design Choices (00:16:04) Benchmark Design Principles (Part 1) (00:18:54) Sponsors: GiveWell | SelectQuote (00:21:44) Benchmark Design Principles (Part 2) (00:22:35) AI vs. Human Evaluation (00:26:55) Optimizing Runtimes (00:36:02) Sponsors: Oracle Cloud Infrastructure (OCI) | Weights & Biases RAG++ (00:38:20) AI Myopia (00:43:37) Optimizing Loss (00:47:59) Optimizing Win Rate (00:50:24) Best of K Analysis (01:02:26) Best of K Limitations (01:09:04) Agent Interaction Modalities (01:12:34) Analyzing Benchmark Results (01:17:16) Model Performance Differences (01:22:49) Elicitation and Scaffolding (01:27:08) Context Window & Best of K (01:35:17) Reward Hacking & Bad Behavior (01:43:47) Future Directions & Hiring (01:46:20) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Breakthroughs in AI for Biology: AI Lab Groups & Protein Model Interpretability with Prof James Zou 1:02:49

acum 5 weeks1:02:49

1:02:49

Nathan discusses groundbreaking AI and biology research with Stanford Professor James Zou from the Chan Zuckerberg Initiative. In this episode of The Cognitive Revolution, we explore two remarkable papers: the virtual lab framework that created novel COVID treatments with minimal human oversight, and InterPLM's discovery of new protein motifs through mechanistic interpretability. Join us for an fascinating discussion about how AI is revolutionizing biological research and drug discovery. Got questions about AI? Submit them for our upcoming AMA episode + take our quick listener survey to help us serve you better - https://docs.google.com/forms/d/e/1FAIpQLSefHvs1-1g5xeqM7wSirQkzTtK-1fgW_OjyHPH9DvmbVAjEzA/viewform SPONSORS: SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. GiveWell : GiveWell has spent over 17 years researching global health and philanthropy to identify the highest-impact giving opportunities. Over 125,000 donors have contributed more than $2 billion, saving over 200,000 lives through evidence-backed recommendations. First-time donors can have their contributions matched up to $100 before year-end. Visit https://GiveWell.org select podcast, and enter Cognitive Revolution at checkout to make a difference today. CHAPTERS: CHAPTERS: (00:00:00) Teaser (00:00:35) About the Episode (00:04:30) Virtual Lab (00:08:09) AI Designs Nanobodies (00:14:43) Novel AI Pipeline (00:20:31) Human-AI Interaction (Part 1) (00:20:33) Sponsors: SelectQuote | Oracle Cloud Infrastructure (OCI) (00:23:22) Human-AI Interaction (Part 2) (00:32:31) Sponsors: 80,000 Hours | GiveWell (00:35:10) Project Cost & Time (00:41:04) Future of AI in Bio (00:45:46) InterPLM: Intro (00:50:30) AI Found New Concepts (00:55:02) Discovering New Motifs (00:57:14) Limitations & Future (01:01:32) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Scouting Frontiers in AI for Biology: Dynamics, Diffusion, and Design, with Amelie Schreiber 1:47:28

acum 6 weeks1:47:28

1:47:28

Nathan welcomes back computational biochemist Amelie Schreiber for a fascinating update on AI's revolutionary impact in biology. In this episode of The Cognitive Revolution, we explore recent breakthroughs including AlphaFold3, ESM3, and new diffusion models transforming protein engineering and drug discovery. Join us for an insightful discussion about how AI is reshaping our understanding of molecular biology and making complex protein engineering tasks more accessible than ever before. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Shopify: Shopify is the world's leading e-commerce platform, offering a market-leading checkout system and exclusive AI apps like Quikly. Nobody does selling better than Shopify. Get a $1 per month trial at https://shopify.com/cognitive SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive Weights & Biases RAG++: Advanced training for building production-ready RAG applications. Learn from experts to overcome LLM challenges, evaluate systematically, and integrate advanced features. Includes free Cohere credits. Visit https://wandb.me/cr to start the RAG++ course today. CHAPTERS: (00:00:00) Teaser (00:00:46) About the Episode (00:04:30) AI for Biology (00:07:14) David Baker's Impact (00:11:49) AlphaFold 3 & ESM3 (00:16:40) Protein Interaction Prediction (Part 1) (00:16:44) Sponsors: Shopify | SelectQuote (00:19:18) Protein Interaction Prediction (Part 2) (00:31:12) MSAs & Embeddings (Part 1) (00:32:32) Sponsors: Oracle Cloud Infrastructure (OCI) | Weights & Biases RAG++ (00:34:49) MSAs & Embeddings (Part 2) (00:35:57) Beyond Structure Prediction (00:51:13) Dynamics vs. Statics (00:57:24) In-Painting & Use Cases (00:59:48) Workflow & Platforms (01:06:45) Design Process & Success Rates (01:13:23) Ambition & Task Definition (01:19:25) New Models: PepFlow & GeoAB (01:28:23) Flow Matching vs. Diffusion (01:30:42) ESM3 & Multimodality (01:37:10) Summary & Future Directions (01:45:34) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Building Government's Largest Civilian AI Team with DHS AI Corps' Director, Michael Boyce 1:30:11

acum 6 weeks1:30:11

1:30:11

In this episode of The Cognitive Revolution, Nathan interviews Michael Boyce, Director of DHS's AI Corps, about bringing modern AI capabilities to federal government. We explore how the largest civilian AI team in government is transforming DHS's 22 agencies, from developing shared AI infrastructure to innovative applications like AI-powered asylum interview training. Join us for an insightful conversation about the intersection of artificial intelligence and public service, and discover why AI professionals should consider a career in government. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Shopify: Shopify is the world's leading e-commerce platform, offering a market-leading checkout system and exclusive AI apps like Quikly. Nobody does selling better than Shopify. Get a $1 per month trial at https://shopify.com/cognitive SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amadei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. RECOMMENDED PODCAST: Unpack Pricing - Dive into the dark arts of SaaS pricing with Metronome CEO Scott Woody and tech leaders. Learn how strategic pricing drives explosive revenue growth in today's biggest companies like Snowflake, Cockroach Labs, Dropbox and more. Apple: https://podcasts.apple.com/us/podcast/id1765716600 Spotify: https://open.spotify.com/show/38DK3W1Fq1xxQalhDSueFg CHAPTERS: (00:00:00) Teaser (00:01:00) About the Episode (00:03:38) Introducing Michael Boyce (00:05:49) What is Homeland Security? (00:09:52) History of AI at DHS (00:13:15) Generative AI at DHS (00:16:03) Structure of the AI Core (Part 1) (00:18:17) Sponsors: Shopify | SelectQuote (00:20:51) Structure of the AI Core (Part 2) (00:22:04) Opportunities for AI at DHS (00:25:34) Bureaucracy Hacker (00:30:34) The Manager's Role (Part 1) (00:35:24) Sponsors: Oracle Cloud Infrastructure (OCI) | 80,000 Hours (00:38:04) Internal Chatbot Project (00:43:28) AI Role Playing for Training (00:49:55) A Request for Startups (00:57:46) Generative AI for Quality Check (01:03:20) AI Training at DHS (01:06:07) Metrics and the Future of AI (01:13:26) Non-Generative AI at DHS (01:19:08) AI and Automation at DHS (01:23:03) Join the AI Core (01:28:39) Outro…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Emergency Pod: o1 Schemes Against Users, with Alexander Meinke from Apollo Research 2:06:52

acum 7 weeks2:06:52

2:06:52

In this emergency episode of The Cognitive Revolution, Nathan discusses alarming findings about AI deception with Alexander Meinke from Apollo Research. They explore Apollo's groundbreaking 70-page report on "Frontier Models Are Capable of In-Context Scheming," revealing how advanced AI systems like OpenAI's O1 can engage in deceptive behaviors. Join us for a critical conversation about AI safety, the implications of scheming behavior, and the urgent need for better oversight in AI development. Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive 80,000 Hours: 80,000 Hours is dedicated to helping you find a fulfilling career that makes a difference. With nearly a decade of research, they offer in-depth material on AI risks, AI policy, and AI safety research. Explore their articles, career reviews, and a podcast featuring experts like Anthropic CEO Dario Amadei. Everything is free, including their Career Guide. Visit https://80000hours.org/cognitiverevolution to start making a meaningful impact today. Shopify: Shopify is the world's leading e-commerce platform, offering a market-leading checkout system and exclusive AI apps like Quikly. Nobody does selling better than Shopify. Get a $1 per month trial at https://shopify.com/cognitive RECOMMENDED PODCAST: Unpack Pricing - Dive into the dark arts of SaaS pricing with Metronome CEO Scott Woody and tech leaders. Learn how strategic pricing drives explosive revenue growth in today's biggest companies like Snowflake, Cockroach Labs, Dropbox and more. Apple: https://podcasts.apple.com/us/podcast/id1765716600 Spotify: https://open.spotify.com/show/38DK3W1Fq1xxQalhDSueFg CHAPTERS: (00:00:00) Teaser (00:00:53) About the Episode (00:08:10) Introducing Alexander Meinke (00:10:17) Red Teaming GPT-4 (00:17:07) Chain of Thought Access (Part 1) (00:20:24) Sponsors: Oracle Cloud Infrastructure (OCI) | SelectQuote (00:22:48) Chain of Thought Access (Part 2) (00:26:07) Multimodal Models (00:29:33) Defining Scheming (00:33:51) Taxonomy of Scheming (Part 1) (00:39:40) Sponsors: 80,000 Hours | Shopify (00:42:29) Taxonomy of Scheming (Part 2) (00:43:09) Instruction Hierarchy (00:49:04) Types of Scheming (01:00:49) Covert Subversion (01:14:25) Deferred Subversion (01:28:24) Sandbagging (01:35:48) Magnitudes & Trends (01:48:18) Chain of Thought Reasoning (01:57:02) Closing Thoughts (02:05:19) Outro PRODUCED BY: http://aipodcast.ing…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
Automating Scientific Discovery, with Andrew White, Head of Science at Future House 1:58:32

acum 7 weeks1:58:32

1:58:32

In this episode of The Cognitive Revolution, Nathan interviews Andrew White, Professor of Chemical Engineering at the University of Rochester and Head of Science at Future House. We explore groundbreaking AI systems for scientific discovery, including PaperQA and Aviary, and discuss how large language models are transforming research. Join us for an insightful conversation about the intersection of AI and scientific advancement with this pioneering researcher in his first-ever podcast appearance. Check out Future House: https://www.futurehouse.org Help shape our show by taking our quick listener survey at https://bit.ly/TurpentinePulse SPONSORS: Oracle Cloud Infrastructure (OCI): Oracle's next-generation cloud platform delivers blazing-fast AI and ML performance with 50% less for compute and 80% less for outbound networking compared to other cloud providers13. OCI powers industry leaders with secure infrastructure and application development capabilities. New U.S. customers can get their cloud bill cut in half by switching to OCI before December 31, 2024 at https://oracle.com/cognitive SelectQuote: Finding the right life insurance shouldn't be another task you put off. SelectQuote compares top-rated policies to get you the best coverage at the right price. Even in our AI-driven world, protecting your family's future remains essential. Get your personalized quote at https://selectquote.com/cognitive Shopify: Shopify is the world's leading e-commerce platform, offering a market-leading checkout system and exclusive AI apps like Quikly. Nobody does selling better than Shopify. Get a $1 per month trial at https://shopify.com/cognitive CHAPTERS: (00:00:00) Teaser (00:01:13) About the Episode (00:04:37) Andrew White's Journey (00:10:23) GPT-4 Red Team (00:15:33) GPT-4 & Chemistry (00:17:54) Sponsors: Oracle Cloud Infrastructure (OCI) | SelectQuote (00:20:19) Biology vs Physics (00:23:14) Conceptual Dark Matter (00:26:27) Future House Intro (00:30:42) Semi-Autonomous AI (00:35:39) Sponsors: Shopify (00:37:00) Lab Automation (00:39:46) In Silico Experiments (00:45:22) Cost of Experiments (00:51:30) Multi-Omic Models (00:54:54) Scale and Grokking (01:00:53) Future House Projects (01:10:42) Paper QA Insights (01:16:28) Generalizing to Other Domains (01:17:57) Using Figures Effectively (01:22:01) Need for Specialized Tools (01:24:23) Paper QA Cost & Latency (01:27:37) Aviary: Agents & Environments (01:31:42) Black Box Gradient Estimation (01:36:14) Open vs Closed Models (01:37:52) Improvement with Training (01:40:00) Runtime Choice & Q-Learning (01:43:43) Narrow vs General AI (01:48:22) Future Directions & Needs (01:53:22) Future House: What's Next? (01:55:32) Outro SOCIAL LINKS: Website: https://www.cognitiverevolution.ai Twitter (Podcast): https://x.com/cogrev_podcast Twitter (Nathan): https://x.com/labenz LinkedIn: https://www.linkedin.com/in/nathanlabenz/ Youtube: https://www.youtube.com/@CognitiveRevolutionPodcast Apple: https://podcasts.apple.com/de/podcast/the-cognitive-revolution-ai-builders-researchers-and/id1669813431 Spotify: https://open.spotify.com/show/6yHyok3M3BjqzR0VB5MSyk…

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

1
The Evolution of AI Agents: Lessons from 2024, with MultiOn CEO Div Garg 1:30:21

acum 7 weeks1:30:21