Blog
Thoughts on AI, Software Engineering, and Technology
All Posts (7)
AI (3)
AI Agents (1)
AI in Coding (1)
Agentic-AI (2)
Architecture (2)
Cloud (2)
Cursor IDE (1)
Data Science (3)
Developer Productivity (1)
Evaluation Frameworks (1)
Linear Algebra (3)
Machine Learning (3)
Mathematics (1)
Matrix Equations (2)
Python (1)
Software Development (1)
Tech Trends (1)
2025
May
April
Making Sure AI Agents Play Nice: A Look at How We Evaluate Them
An in-depth analysis of evaluation frameworks for AI agents, covering conversational, autonomous, and multi-agent systems with practical insights and methodologies.
Read more