How we evaluate AI models and LLMs for GitHub Copilot
We share some of the GitHub Copilot team’s experience evaluating AI models, with a focus on our offline evaluations—the tests we run before making any change to our production environment.
GitHub engineers and industry thought leaders offer tips, best practices, and practical explainers about various aspects of AI and ML, ranging from fundamental concepts to advanced techniques and real-world applications. For more detailed documentation and practical guides on GitHub’s own AI coding tool, GitHub Copilot, check out GitHub’s official documentation.
We share some of the GitHub Copilot team’s experience evaluating AI models, with a focus on our offline evaluations—the tests we run before making any change to our production environment.
Learn how to document and explain legacy code with GitHub Copilot with real-world examples.
How Copilot can generate unit tests, refactor code, create documentation, perform multi-file edits, and much more.
We released a new open source byte-pair tokenizer that is faster and more flexible than popular alternatives.
Learn how to generate unit tests with GitHub Copilot and get specific examples, a tutorial, and best practices.
Developers tell us how GitHub Copilot and other AI coding tools are transforming their work and changing how they spend their days.
GitHub Next launched the technical preview for GitHub Copilot Workspace in April 2024. Since then, we’ve been listening to the community, learning, and have some tips to share on how to get the most out of it!
Students used GitHub Copilot to decode ancient texts buried in Mount Vesuvius, achieving a groundbreaking historical breakthrough. This is their journey, the technology behind it, and the power of collaboration.
Learn how AI agents and agentic AI systems use generative AI models and large language models to autonomously perform tasks on behalf of end users.
To enhance your coding experience, AI tools should excel at saving you time with repetitive, administrative tasks, while providing accurate solutions to assist developers. Today, we’re spotlighting three updates designed to increase efficiency and boost developer creativity.
Learn how we’re experimenting with open source AI models to systematically incorporate customer feedback to supercharge our product roadmaps.
Unstructured data holds valuable information about codebases, organizational best practices, and customer feedback. Here are some ways you can leverage it with RAG, or retrieval-augmented generation.
Here’s how SAST tools combine generative AI with code scanning to help you deliver features faster and keep vulnerabilities out of code.
Here’s how retrieval-augmented generation, or RAG, uses a variety of data sources to keep AI models fresh with up-to-date information and organizational knowledge.
Learn how your organization can customize its LLM-based solution through retrieval augmented generation and fine-tuning.
Explore the capabilities and benefits of AI code generation, and how it can improve the developer experience for your enterprise.
Learn how we’re experimenting with generative AI models to extend GitHub Copilot across the developer lifecycle.
Here’s everything you need to know to build your first LLM app and problem spaces you can start exploring today.
Explore how LLMs generate text, why they sometimes hallucinate information, and the ethical implications surrounding their incredible capabilities.
Build what’s next on GitHub, the place for anyone from anywhere to build anything.