Blog
Filter by tags:AILLMPythonData EngineeringBackendThinking Toolsdeep learningmachine learningneural networkspytorchAI AgentsAPI IntegrationAutomationCRMClaudeComputer VisionCreativityDeep LearningEmbeddingsEvaluationJIRAMachine LearningProductivitySalesforceSecurityStorageWeb DevelopmentWritingtrainingAI ApplicationsAI EngineeringAI GatewayAPIAPI ManagementASGIAgent ArchitectureAgent SkillsAgentic SystemsAgentsAnalyticsAnthropicArchitectureAuthenticationAuthorizationBackend EngineeringBest PracticesCDCCNNChatbotCloud StorageCognitionDOCXData IngestionData ModelingData PipelinesData SystemsDatabaseDebuggingDesktop ApplicationDeveloper ToolsDocument ParsingEmailFastAPIFastHTMLFile ProcessingFile UploadFile UploadsFrameworkFundamentalsGeminiGmail APIGoogle DriveHTMXIndustrial AutomationInfrastructureJudgingKnowledge DistillationLLM JudgeLinear RegressionMCPManufacturingModel CompressionMulti-Step ReasoningNLPNestJSOpenTelemetryPDFPerformanceProduction SystemsPrompt EngineeringPruningPydanticPydantic AIQualityQuality ControlQuantizationRAGREST APIReliabilitySOQLSemantic SearchSpeechStatisticsSteel IndustryTask ManagementTechnical ArticlesText ProcessingTool-Based AIUXValidationVector Searchactivation layerloss functions
-
'LLM as a judge' for evaluation using Pydantic AI
A comprehensive guide to using LLMJudge in Pydantic AI for evaluating subjective qualities of LLM outputs based on rubrics.
31 Dec 2025by Arpit Pathak -
LLM-as-a-Judge: What It Is, How to Use It, and When Not To
A practical guide to using LLMs as evaluators: when to use them, when not to, and how to design effective rubrics for robust, reliable AI evaluation.
31 Dec 2025
Subscribe via RSS or enter your email to get notified of new posts directly in your inbox