Course Information
Course Overview
Essentials of Low-level Interpretation
Course overview
How programming languages work under the hood? What’s the difference between compiler and interpreter? What is a virtual machine, and JIT-compiler? And what about the difference between functional and imperative programming?
There are so many questions when it comes to implementing a programming language!
The problem with “compiler classes” in school is such classes are usually presented as some “hardcore rocket science” which is only for advanced engineers.
Moreover, classic compiler books start from the least significant topic, such as Lexical analysis, going straight down to the theoretical aspects of formal grammars. And by the time of implementing the first Tokenizer module, students simply lose an interest to the topic, not having a chance to actually start implementing a programing language itself. And all this is spread to a whole semester of messing with tokenizers and BNF grammars, without understanding an actual semantics of programming languages.
I believe we should be able to build and understand a full programming language semantics, end-to-end, in 4-6 hours — with a content going straight to the point, showed in live coding sessions as pair-programming and described in a comprehensible way.
In the Building a Virtual Machine class we focus specifically on runtime semantics, and build a stack-based VM for a programming language very similar to JavaScript or Python. Working closely with the bytecode level you will understand how lower-level interpretation works in production VMs today.
Implementing a programing language would also make your practical level in other programming languages more professional.
Prerequisites
There are two prerequisites for this class.
The Building a Virtual Machine course is a natural extension for the previous class — Building an Interpreter from scratch (aka Essentials of Interpretation), where we build also a full programming language, but at a higher, AST-level. Unless you already have understanding of how programming languages work at this level, i.e. what eval, a closure, a scope chain, environments, and other constructs are — you have to take the interpreters class as a prerequisite.
Also, going to lower (bytecode) level where production VMs live, we need to have basic C++ experience. This class however is not about C++, so we use just very basic (and transferrable) to other languages constructs.
Watch the introduction video for the details.
Who this class is for?
This class is for any curious engineer, who would like to gain skills of building complex systems (and building a programming language is an advanced engineering task!), and obtain a transferable knowledge for building such systems.
If you are interested specifically in compilers, bytecode interpreters, virtual machines, and source code transformation, then this class is also for you.
What is used for implementation?
Since lower-level VMs are about performance, they are usually implemented in a low-level language such as C or C++. This is exactly what we use as well, however mainly basic features from C++, not distracting to C++ specifics. The code should be easily convertible and portable to any other language, e.g. to Rust or even higher-level languages such as JavaScript — leveraging typed arrays to mimic memory concept. Using C++ also makes it easier implementing further JIT-compiler.
Note: we want our students to actually follow, understand and implement every detail of the VM themselves, instead of just copy-pasting from final solution. Even though the full source code for the language is presented in the video lectures, the code repository for the project contains /* Implement here */ assignments, which students have to solve.
What’s specific in this class?
The main features of these lectures are:
Concise and straight to the point. Each lecture is self-sufficient, concise, and describes information directly related to the topic, not distracting on unrelated materials or talks.
Animated presentation combined with live-editing notes. This makes understanding of the topics easier, and shows how the object structures are connected. Static slides simply don’t work for a complex content.
Live coding session end-to-end with assignments. The full source code, starting from scratch, and up to the very end is presented in the video lectures
What is in the course?
The course is divided into five parts, in total of 29 lectures, and many sub-topics in each lecture. Below is the table of contents and curriculum.
PART 1: VM BASIC OPERATIONS
In this part we describe compilation and interpretation pipeline, starting building our language. Topics of Stack and Register VMs, heap-allocated objects and compilation of the bytecode are discussed.
PART 2: CONTROL FLOW AND VARIABLES
In this part we implement control flow structures such as if expressions and while loops, talk about Global object and global variables, nested blocks and local variables, and also implement a disassembler.
PART 3.1: FUNCTIONS AND CALL STACK
In this part we start talking and implementing function abstraction and function calls. We describe concept of the Call stack, native and user-defined functions, and IILEs (Immediately-invoked lambda expressions).
PART 3.2: CLOSURES IMPLEMENTATION
In this part we focus on closures implementation, talking about scope and escape analysis, capturing free variables, and adding runtime support for closures.
PART 4: GARBAGE COLLECTION
This part is devoted to the automatic memory management known as Garbage collection. We discuss a tracing heap and implement Mark-Sweep garbage collector.
PART 5: OBJECT-ORIENTED PROGRAMMING
In the final part we add support for Object-oriented programming, implementing classes and instances. In addition we build the final VM executable.
Course Content
- 6 section(s)
- 29 lecture(s)
- Section 1 VM basic operations
- Section 2 Control flow and variables
- Section 3 Functions and Call stack
- Section 4 Closures implementation
- Section 5 Garbage Collection
- Section 6 Object-oriented programming
What You’ll Learn
- Virtual Machines implementations
- Stack-based vs. Register-based VMs
- Bytecode interpreter
- Compiler construction
- Call stack and Stack frames
- Low-level interpretation
- Object-oriented programming
- Functional programming
- Closures implementation
- Garbage Collection
- Mark-Sweep GC
- Understand how programming languages work under the hood
- Bytecode optimization
Skills covered in this course
Reviews
-
PPablo Medina
excelente tutorial, podrías proporcionar el código terminado para un mejor análisis, gracias.
-
TTiago Henrique Pereira
the classes are good, direct and clear but the syntax of the eva language confused me a little, if it were a syntax more similar to c++ or javascript it would make learning faster and more practical
-
SShubham Kumar
This course covered large number of topics. But while writing code it had missed few important edits in the video which made me confused since I was following it second to second. If you follow along the course and write the exact changes as shown in video you will get different errors a shown in video. Those errors were thrown because few changes were not shown by the instructor. This will discourage any learner.
-
LLuca Ottaviano
Great course, straight to the point. It really rubbed my nerdy side! Warning: this may take far more than the 4.5h of video lessons to complete, be prepared to spend 3-5 full days, depending on how much you want to digress. My biggest issue is that in my opinion the language he came up with and the bytecode specification are not very clear, so much so that at times I spent way too much time trying to run programs that probably had no hope of working. I'd love to have a reference implementation to test my Eva programs against! I spent so much time scanning video lessons to search for a hint of why my program was crashing. All in all I had fun and learnt many things, great!