Published onDecember 5, 2025A Practical Guide to RAG Evaluation With RAGAS Metrics and Confidence IntervalsRAGLarge-Language-ModelsMachine-LearningNatural-Language-ProcessingHow to model query quality, use bootstrapping, and report realistic RAG performance with RAGAS metrics and confidence intervals.
Published onSeptember 30, 2023Red Teaming Large Language ModelsRed-TeamingLarge-Language-ModelsExploring Recent Techniques to Uncover and Mitigate Undesirable Behaviors in Language Models
Published onSeptember 28, 2023Using Bayesian Optimization for Red Teaming Large Language ModelsBayesian-OptimizationLarge-Language-ModelsRed-TeamingUsing Bayesian Optimization for Red Teaming Large Language Models