Home /
technology Podcasts /
Into AI Safety /
HACKATHON: Evals November 2023 (2)

EPISODE · Feb 5, 2024 · 48 MIN

HACKATHON: Evals November 2023 (2)

from Into AI Safety · host Jacob Haimes

Join our hackathon group for the second episode in the Evals November 2023 Hackathon subseries. In this episode, we solidify our goals for the hackathon after some preliminary experimentation and ideation.Check out Stellaric's website, or follow them on Twitter.01:53 - Meeting starts05:05 - Pitch: extension of locked models23:23 - Pitch: retroactive holdout datasets34:04 - Preliminary results37:44 - Next steps42:55 - RecapLinks to all articles/papers which are mentioned throughout the episode can be found below, in order of their appearance.Evalugator libraryPassword Locked Model blogpostTruthfulQA: Measuring How Models Mimic Human FalsehoodsBLEU: a Method for Automatic Evaluation of Machine TranslationBoolQ: Exploring the Surprising Difficulty of Natural Yes/No QuestionsDetecting Pretraining Data from Large Language Models

NOW PLAYING

HACKATHON: Evals November 2023 (2)

0:00 48:39

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

EP 100 | Listener Q&A: Real Answers to Your Email Marketing and Copywriting Questions

Jun 11, 2025 ·21m

EP 99 | 5 Simple Ways to Add More Personality to Your Emails

Jun 4, 2025 ·9m

EP 98 | 4 Lies Online Business Owners Believe About Storytelling in Marketing

May 28, 2025 ·14m

EP 97 | Ditch Boring PDFs! Try These 2 High-Value Lead Magnets Instead

May 21, 2025 ·11m

EP 96 | Frustrated with Email Marketing? 8 Signs That Something Is Off

May 14, 2025 ·7m

EP 95 | Email Revival Project [3]: First Attempts at Resurrecting a Dead List

May 7, 2025 ·12m

Similar Podcasts

AI – IC之音竹科廣播 FM97.5 IC之音竹科廣播全球華人的心靈故鄉 Copy That Converts - Entrepreneurs, Copywriting, Launch, Email Marketing, Conversion Megan Wisdom | Copywriter, Email Metrics Mentor, Marketing Strategist Are you a female entrepreneur with an online business who’s struggling to grow and nurture your audience? Do you feel like you’re not making enough sales, despite your best efforts? Do you feel confused by all the marketing jargon and just wish you had a bossy business big sister to shoot it to you straight?Hey, friend. I know you didn’t get into business to get bogged down by writing, but let’s face it, the internet is still powered by WORDS. The good news? You can harness the power of those words to connect with your ideal clients and make more sales through the magic of copywriting.In each episode, we’ll dive deep into the world of copywriting and marketing, sharing insights and strategies that will help you craft compelling messages that resonate with your audience. From understanding your ideal customer to mastering the art of storytelling, we’ll cover it all.I’m Megan Wisdom, a firstborn, Enneagram 5 copywriter who loves to help other female entrepreneurs reach their business fuzz – Swamp Jacuzzi Biggie Boutte An intoxicating wild mind trip through the past, present, and future realms of rock n roll. A euphoric cocktail of spiritual awakening through fuzz and focal points. A new dawn taking the past into the future and the future towards comforts unknown. A yesterday's tomorrow. That time is now. So free your soul and expand your mind. The key to the gates is through this sonic elixir. Administer the medicine, fasten your seatbelts and hold on tight. We have a long journey ahead. But if you want to rock it, you know it's in the pocket. You need Electrophonic Tonic. It could save your soul. Ya dig? The Inner Circle UBS advisor podcasts Step into The Inner Circle, a dynamic and engaging podcast hosted by The Radius Group of UBS. Leveraging their extensive wealth management experience and a diverse network of industry experts, The Inner Circle explores the latest trends while sharing timeless wealth management techniques. Don’t miss out – elevate your financial knowledge by joining The Inner Circle today!

URL copied to clipboard!