EPISODE · Jun 6, 2024 · 51 MIN
Ep. 255 - June 5, 2024
from TechcraftingAI NLP · host Brad Edwards
ArXiv NLP research for Wednesday, June 05, 2024. 00:19: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis 01:24: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge 03:01: Text Injection for Neural Contextual Biasing 04:16: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders 06:03: Adversarial Moment-Matching Distillation of Large Language Models 07:05: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models 08:48: Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese 09:56: Evaluation of data inconsistency for multi-modal sentiment analysis 10:55: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents 12:11: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models 13:16: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation 14:20: StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning 15:42: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization 17:00: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework 18:14: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud? 19:48: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models 20:59: Space Decomposition for Sentence Embedding 22:00: Towards Real-world Scenario: Imbalanced New Intent Discovery 23:40: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation 25:20: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs 27:03: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language 28:10: Missci: Reconstructing Fallacies in Misrepresented Science 29:43: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction 30:47: Linking Named Entities in Diderot's \textit{Encyclop\'edie} to Wikidata 32:06: Error-preserving Automatic Speech Recognition of Young English Learners' Language 33:37: Document-level Claim Extraction and Decontextualisation for Fact-Checking 34:45: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches 36:09: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback 37:39: IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models 39:46: Automating Turkish Educational Quiz Generation Using Large Language Models 41:34: Cycles of Thought: Measuring LLM Confidence through Stable Explanations 42:57: Are language models rational? The case of coherence norms and belief revision 43:58: What is the Best Way for ChatGPT to Translate Poetry? 45:20: Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types 46:14: MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization 47:09: BIPED: Pedagogically Informed Tutoring System for ESL Education 48:24: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends 50:00: Wings: Learning Multimodal LLMs without Text-only Forgetting
What this episode covers
ArXiv NLP research for Wednesday, June 05, 2024. 00:19: Improving In-Context Learning with Prediction Feedback for Sentiment Analysis 01:24: MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge 03:01: Text Injection for Neural Contextual Biasing 04:16: 4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders 06:03: Adversarial Moment-Matching Distillation of Large Language Models 07:05: Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models 08:48: Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese 09:56: Evaluation of data inconsistency for multi-modal sentiment analysis 10:55: BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents 12:11: Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models 13:16: From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation 14:20: StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning 15:42: RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization 17:00: Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework 18:14: Cryptocurrency Frauds for Dummies: How ChatGPT introduces us to fraud? 19:48: FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models 20:59: Space Decomposition for Sentence Embedding 22:00: Towards Real-world Scenario: Imbalanced New Intent Discovery 23:40: Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation 25:20: CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs 27:03: StatBot.Swiss: Bilingual Open Data Exploration in Natural Language 28:10: Missci: Reconstructing Fallacies in Misrepresented Science 29:43: ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction 30:47: Linking Named Entities in Diderot's \textit{Encyclop\'edie} to Wikidata 32:06: Error-preserving Automatic Speech Recognition of Young English Learners' Language 33:37: Document-level Claim Extraction and Decontextualisation for Fact-Checking 34:45: The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches 36:09: LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback 37:39: IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models 39:46: Automating Turkish Educational Quiz Generation Using Large Language Models 41:34: Cycles of Thought: Measuring LLM Confidence through Stable Explanations 42:57: Are language models rational? The case of coherence norms and belief revision 43:58: What is the Best Way for ChatGPT to Translate Poetry? 45:20: Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types 46:14: MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization 47:09: BIPED: Pedagogically Informed Tutoring System for ESL Education 48:24: Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends 50:00: Wings: Learning Multimodal LLMs without Text-only Forgetting
NOW PLAYING
Ep. 255 - June 5, 2024
No transcript for this episode yet
Similar Episodes
May 1, 2026 ·74m
Apr 22, 2026 ·7m
Feb 4, 2026 ·60m