Upcoming Open Mic - LLM-Powered Coder Assistants

May 16, 2025 less than 1 minute read

The next Open Mic session starts on Friday 23rd May 2025 at 9:30 at this link. Speaker Evgenii Grigorev will unpack how Large Language Models (LLMs) like GPT-4 and CodeLlama generate code, blending theory with real-world examples.

Abstract

Code-generating LLMs are not wizards — they’re sophisticated pattern matchers trained on terabytes of code. But how do they turn a prompt like “Sort this CSV by date and calculate weekly averages” into working Python? This session will demystify the :

Core mechanics – Transformers, attention layers, and tokenization
Training secrets: From GitHub scrapes to context-aware fine-tuning.
Why they fail: Hallucinations, hidden biases, and the “copy-paste paradox”.
Examples from data analysis (Pandas, SQL) will illustrate key concepts.

Outline

Introduction to LLMs: Transformers, tokenization, and the “autocomplete on steroids” paradigm.
Tools Deep Dive: GitHub Copilot, ChatGPT, CodeWhisperer, and open-source alternatives (StarCoder, Llama 3).
Under the Hood: Training on GitHub data, context window limitations, and safety guardrails.
Pros vs. Cons: 55% faster coding (GitHub study) vs. 40% of generated code containing vulnerabilities (Stanford research).

Share on

Twitter Facebook LinkedIn

KBSS

Upcoming Open Mic - LLM-Powered Coder Assistants

Abstract

Share on

You may also enjoy

Upcoming Open Mic - Exploring Ontology Engineering Workflow with TermIt and Other Tools

Upcoming Open Mic - An Interactive Dashboard for Ontology Quality Monitoring

Publishing Open Science metadata

Upcoming Open Mic - Publishing Open Science metadata