From Scratch Code RSS Feed

I'm embarrassed by how much code I cut from my test suite

Mon, 31 Mar 2025 00:00:00 GMT

My parser test suite was a house of cards I’ve never been prouder to see collapse. I found a pattern that worked, kept working, and then worked a little too well. Until rust-analyzer couldn't process my file anymore. Best practice says to refactor before your LSP craps out, but alas. Here's how I saved several thousand lines of test code in my Memphis parser.

Designing an expressive parser test suite

I’m going to break all rules of writing and tell this story Benjamin Button style.

We begin with this idyllic test case. Wouldn’t it be lovely to verify our AST in an expressive yet relaxed way? Yes, it is lovely.

#[test]
fn expression() {
    let input = "2 + 3 * (4 - 1)";
    let expected_ast = bin_op!(
        int!(2),
        Add,
        bin_op!(int!(3), Mul, bin_op!(int!(4), Sub, int!(1)))
    );

    assert_ast_eq!(input, expected_ast, Expr);

    let input = "2 // 3";
    let expected_ast = bin_op!(int!(2), IntegerDiv, int!(3));

    assert_ast_eq!(input, expected_ast, Expr);
}

This test wasn’t all sunflowers and rainbows. This 16-line fact-checker used to come in at a bloated 38 lines.

Have a peep yourself.

#[test]
fn expression() {
    let input = "2 + 3 * (4 - 1)";
    let context = init(input);

    let expected_ast = Expr::BinaryOperation {
        left: Box::new(Expr::Integer(2)),
        op: BinOp::Add,
        right: Box::new(Expr::BinaryOperation {
            left: Box::new(Expr::Integer(3)),
            op: BinOp::Mul,
            right: Box::new(Expr::BinaryOperation {
                left: Box::new(Expr::Integer(4)),
                op: BinOp::Sub,
                right: Box::new(Expr::Integer(1)),
            }),
        }),
    };

    match context.parse_oneshot::<Expr>() {
        Err(e) => panic!("Parser error: {:?}", e),
        Ok(ast) => assert_eq!(ast, expected_ast),
    }

    let input = "2 // 3";
    let context = init(input);

    let expected_ast = Expr::BinaryOperation {
        left: Box::new(Expr::Integer(2)),
        op: BinOp::IntegerDiv,
        right: Box::new(Expr::Integer(3)),
    };

    match context.parse_oneshot::<Expr>() {
        Err(e) => panic!("Parser error: {:?}", e),
        Ok(ast) => assert_eq!(ast, expected_ast),
    }
}

The tool I used to reduce my boilerplate was declarative macros, Rust’s way of generating Rust code at compile time.

By the end of my cleanup trance, I improved these areas:

expressing Python types (int, str, bool, list, set, tuple)
expressing operations (binary, unary, and logical ops)
wrapping the actual entrypoint to parse the input
wrapping error handling for the common case

The result? Shorter tests and clearer intentions.

Expressing Python types

I can now write int!(3) instead of Expr::Integer(3). Blah blah blah so what.

This one doesn’t save a whole lot, but let’s look at two more.

I can now write list![int!(1), int!(2), int!(3)] and set![int!(1), int!(2), int!(3)]. Glancing at the implementation for those macros, we see another key.

macro_rules! list {
    ($($expr:expr),* $(,)?) => {
        Expr::List(vec![
            $($expr),*
        ])
    };
}

macro_rules! set {
    ($($expr:expr),* $(,)?) => {
        Expr::Set(HashSet::from([
            $($expr),*
        ]))
    };
}

My parser tests no longer care that an Expr::List accepts a Vec, while a Expr::Set accepts a HashSet. One could argue I don’t need the HashSet at all because this is just the AST, not the evaluation stage of the interpreter. In that case, I could change the underlying representation by updating the macro.

Expressing operations

This one starts to get really fun. I can now write bin_op!(var!("a"), BitwiseAnd, var!("b")), which expands to the following.

macro_rules! bin_op {
    ($left:expr, $op:ident, $right:expr) => {
        Expr::BinaryOperation {
            left: Box::new($left),
            op: BinOp::$op,
            right: Box::new($right),
        }
    };
}

Not only does this macro hide the two Box initializations (necessary in a recursive enum), it also allows us to write BitwiseAnd rather than BinOp::BitwiseAnd, all without sacrificing any type checking.

Wrapping the parser entrypoint

Previously, I had to write this, which isn’t awful, but is also awful.

let context = init("2 + 3");

match context.parse_oneshot::<Expr>() {
   ...
}

We’re working with a MemphisContext object here, another bloated structure I use to orchestrate the whole evaluation flow. The problem is, this is an evolving interface. I learned the hard way that without a level of indirection in my tests, I’d have to tweak this pattern constantly. Across hundreds of tests, that is obnoxious.

The new approach uses a straight forward parse! macro.

let ast = parse!($input, Statement);
assert_stmt_eq!(ast, $expected);

Wrapping the happy path error handling

As Yogi Berra famously said, 90% of unit tests are one-half mental.

I applied this philosophy to design parse! to handle the happy path, the roughly 90% of my parser tests I expect to be able to parse their Python input successfully. Since these are tests and nothing matters, we can fail loudly on any unexpected exceptions and return the AST quietly otherwise.

macro_rules! parse {
    ($input:expr, $pattern:ident) => {
        match init($input).parse_oneshot::<$pattern>() {
            Err(e) => panic!("Parser error: {:?}", e),
            Ok(ast) => ast,
        }
    };
}

And for those 10% of tests where we expect a parse error? This will do just fine.

macro_rules! expect_error {
    ($input:expr, $pattern:ident) => {
        match init($input).parse_oneshot::<$pattern>() {
            Ok(_) => panic!("Expected a ParserError!"),
            Err(e) => e,
        }
    };
}

This is what it now looks like to confirm an improperly structured dict. While I love match statements, I love even more no longer having to write them.

let input = "{ 2, **second }";
let e = expect_error!(input, Expr);
assert_eq!(e, ParserError::SyntaxError);

The End

I’m embarrassed I didn’t do these steps sooner. Please learn from my mistakes and treat your tests the way you wish to be treated.

I left corporate and still do roadmaps + a Memphis update

Mon, 03 Mar 2025 00:00:00 GMT

Happy March!

This month I am putting a bow on my Q1 roadmap and continuing to unify the two Memphis execution engines.

I chose an update post because I want to “build in public.” What follows is what has been on my mind! I don’t write advice posts (Why you shouldn’t use Vec) or random technical overviews (Vectors vs. Slices in Rust: A Complete Guide) because I am unable to do either with a straight face. If I could, I’d probably still work in a large organization where those kind of ambitious but safe norms are politely celebrated. Someone else will write those pieces and I support them in the same way I support someone running a marathon; I’m not against it, but that doesn’t mean I want to.

It appears this “build in public” includes me breaking the fourth wall on my writing process! Can you tell I struggle in groups? If you’re still here, let’s continue.

Q1 Roadmap

I began Q1 with a list of 6 items I wanted to achieve for From Scratch. As someone who once flew to Chicago to write a list of personal goals on a public library whiteboard with a close friend, I’m no stranger to making goals. What felt new was doing them with that uniquely American fuel: a profit motive.

The Original 6 (the little-known prequel to The Magnificent 7) were:

Launch website testimonials (thanks Jakub!)
Improve SEO for fromscratchcode.com
Launch lead magnet
Run a small trial with paid ads
Write chapters 3-5 of my novella
File taxes and pay Q1 estimated taxes

I would later add two more:

Add CTA to and modernize my personal site
Create a runbook for follow-ups

I’m still working on Chapter 5 (Chapters 1-4 are on From Scratch Press) and the follow-up runbook, but everything else is DONE. Who knew you could accomplish things without Jira for Enterprise.

These records have unintentionally produced a fairly detailed account of the playbook I’m running to build my online business. Which I’m excited to share with you in case you too are interested in how to earn small amounts of money with only an internet connection.

I feel pride looking at this list because it makes my efforts feel less random. I can reassure myself Sure, I’m still growing, but here’s my strategy. I can (and have!) thrown this list into ChatGPT and asked what foundational pieces am I missing. And each time it says “Share your work on social media,” I close the tab and search for From Scratch on Google instead.

Multiple Execution Engines

Memphis has supported two execution engines for about a year now. But barely.

The treewalk interpreter is farthest along and, if you wanted to try to run real code, what you should use. I’ve been strengthening the bytecode VM’s foundation and it’s coming along but slowly. Because I have no deadlines, I’m being deliberate to define Memphis capabilities versus treewalk capabilities versus bytecode VM capabilities. Did I mention I have no deadlines?

The unification has proceeded in the following broad strokes.

Common Entrypoint

Initially, you could pick an engine like this.

# Treewalk is default FOR NOW
memphis example.py

# VM selected using an environment variable
MEMPHIS_ENGINE=vm memphis example.py

Which works fine! But after kicking off the runtime, there was no coming back together. Or reunification.

This remains the interface to select a non-default engine, but I’m gradually unifying more of the code behind the scenes. I’ve also floated the idea of using a Rust feature flag instead of an environment variable to produce a smaller binary.

Common Return Type

Because I respect those who came before me, I wanted to treat Python runtime errors as first-class. Meaning I wouldn’t trap them below deck after hitting an iceberg.

Instead of separate error types for treewalk and VM errors, I would combine them. This would also be a chance to turn this dumping ground I aspirationally called InterpreterError into a type which actually represented possible Python runtime errors.

// The treewalk version was first so it is known as "Interpreter" here
pub enum InterpreterError {
    Exception(DebugCallStack),
    TypeError(Option<String>, DebugCallStack),
    KeyError(String, DebugCallStack),
    ValueError(String, DebugCallStack),
    NameError(String, DebugCallStack),
    AttributeError(String, String, DebugCallStack),
    FunctionNotFound(String, DebugCallStack),
    MethodNotFound(String, DebugCallStack),
    ClassNotFound(String, DebugCallStack),
    ModuleNotFound(String, DebugCallStack),
    DivisionByZero(String, DebugCallStack),
    ExpectedVariable(DebugCallStack),
    ExpectedString(DebugCallStack),
    ExpectedInteger(DebugCallStack),
    ExpectedList(DebugCallStack),
    ExpectedTuple(DebugCallStack),
    ExpectedRange(DebugCallStack),
    ExpectedSet(DebugCallStack),
    ExpectedDict(DebugCallStack),
    ExpectedFloatingPoint(DebugCallStack),
    ExpectedBoolean(DebugCallStack),
    ExpectedObject(DebugCallStack),
    ExpectedClass(DebugCallStack),
    ExpectedFunction(DebugCallStack),
    ExpectedIterable(DebugCallStack),
    ExpectedCoroutine(DebugCallStack),
    WrongNumberOfArguments(usize, usize, DebugCallStack),
    StopIteration(DebugCallStack),
    AssertionError(DebugCallStack),
    MissingContextManagerProtocol(DebugCallStack),
    RuntimeError,
    EncounteredReturn(ExprResult),
    EncounteredRaise,
    EncounteredAwait,
    EncounteredSleep,
    EncounteredBreak,
    EncounteredContinue,
}

pub enum VmError {
    StackUnderflow,
    StackOverflow,
    NameError(String),
    RuntimeError,
}

I landed on this streamlined structure which could now be used on both engines. All my previous ExpectedString, ExpectedInteger, etc., variants are now just a TypeError. This mirrors CPython, where an optional String parameter provides more detail.

pub struct ExecutionError {
    pub debug_call_stack: DebugCallStack,
    pub execution_error_kind: ExecutionErrorKind,
}

pub enum ExecutionErrorKind {
    RuntimeError,
    ImportError(String),
    TypeError(Option<String>),
    KeyError(String),
    ValueError(String),
    NameError(String),
    AttributeError(String, String),
    DivisionByZero(String),
    StopIteration,
    AssertionError,
    MissingContextManagerProtocol,
}

With this unified type, I was able to test for an expected NameError in a crosscheck test. Which reminds me, I should really write more about crosscheck, my testing framework for both engines. I have a fun proc macro in the works there.

Now that both engines returned an ExecutionError, I needed to build an interface to push new stack frames to the DebugCallStack. These would be displayed to the user whenever a Python runtime error occurs in their user code.

Common Debug Stack Trace

I’m still actively working on unifying stack traces, but I’m excited because it is me saying “Memphis supports a debug stack trace, regardless of what execution engine you choose,” rather than just “both engines have a stack trace.” Do you see the difference? It’s a pLaTfOrM. Or maybe I mean fRaMeWoRk? It’s something.

I cleaned up my DebugStackTrace and DebugCallStack structs and moved them into a domain module to indicate they represent a Python stack trace, but should NOT be used as a source of runtime info for an engine.

One challenge is giving each engine access to the right-sized shared state object. This would be how each engine could register new stack frames; think something like state.push_stack_frame(function.to_stack_frame()) when entering a new function context. I’m attempting to balance platform capabilities (MemphisState) against freedom of implementation within each engine (TreewalkState and VmState).

The other challenge of implementing stack traces is it forces you to keep around your metadata for use at the right time. Metadata like file paths and line numbers aren’t necessary for actually running Python code, but they’re essential for debugging. When a statement is parsed and immediately evaluated (treewalk) or immediately compiled (bytecode VM, though this has a bug right now), we record the line number of the start of the current function and increment it each time we see a new line. This parser<>runtime communication is only necessary for stack traces (for me. so far.).

This work stream has been a crash course in applying the Single-Responsibility Principle a few decades after I first learned of it. It’s easy to understand the definition (”of course each function/struct/class only does one thing!”). But applying it? That’s harder (”we’ve got access to state here so I’ll put it there!”). The result is my code is GRADUALLY shifting from a medium number of medium-sized functions to a whole lot of tiny ones. I’ve always been an advocate for adding a second entrypoint, either through unit tests or a REPL, but Memphis has forced me to expand that thinking to an entirely different level.

The End

I’m putting the finishing touches on my Q2 roadmap, which should take my business to the max! Do people still say that?

I recently finished The Pathless Path by Paul Millerd and his message of finding the work each of us wants to do indefinitely resonated with me. With my Memphis engine work and my roadmap work, I believe I’m closer than ever to finding that. I’d also love to be a technical mentor to anyone reading this. Because money.

Hope you are well!

Build a software career with meaning: a playbook

Mon, 17 Feb 2025 00:00:00 GMT

Even with 20 fingers and toes, I barely have enough digits to count how many times during my 9-5 career I thought, “Wow, I feel frustrated/stuck/alone/hungry, maybe I should look for a new job.” A new job holds the allure of allowing you to let go of all your negative emotions (especially hunger) and start over.

There are plenty of reasons to switch jobs, particularly when advocating for fairer compensation or a less-toxic environment. But if you're like me and looking to build a meaningful software career, where you can use your brain to its fullest and maybe do a bit of good, job hopping is more like a salve on a wound than a complete fix.

With this in mind, I’m excited to announce my new email course: Build a Software Career with Meaning. While under development, I called this “How I went from ‘Hello World!’ to ‘How can I help?’ in just 19 years,” which is clearly a cheeky title but captures the work-in-progress feel of my career.

Did I mention it’s free? Over the course of 5 (business) days, you’ll receive a brief email with a piece of wisdom, along with an actionable thought experiment you can test on your career.

I also want to be transparent: in email marketing speak, this is called a “lead magnet.” I learned this term from reading (does YouTube still exist?), so I can only assume this is referring to the element from the periodic table with the symbol Pb (you know, the one pronounced ‘led’). And here I thought lead wasn't magnetic!

I'm offering the course for free because I want to share how I think about things. At the end of 5 (business) days, perhaps you’ll feel like you know a bit more about my values and want to work with me. If you don’t, that’s more than okay! You are welcome to stay on the list indefinitely, and the option to unsubscribe will always be at the bottom. There are no tricks here, just a marketing playbook I’ve been learning in between implementing my Markdown blog and nested functions in Python bytecode.

If you’ve ever felt stuck or disillusioned in your software career, I hope this course gives you a new perspective. If nothing else, it’ll be five (business) days of me popping into your email client with career advice just a tad more nuanced than “Quit your job!”

P.S. If you’re interested in reading the story about how my 9-5 career crashed-and-burned—and how I built something better from the wreckage—I have the complete account over on From Scratch Press. I’d love to hear if any of my experiences and catatonic thought loops (the kind where you forget to eat) mirror your own!

Building a Markdown blog with links optimized for Gatsby

Mon, 27 Jan 2025 00:00:00 GMT

While I was hoping this website would build itself, it ended up taking a good amount of fine-tuning! I wanted to share how I built this blog and a few of the challenges I encountered.

Background

My setup for this website is:

domain and DNS managed on GoDaddy
code lives in a private GitHub repository (should I make this public?)
static site written in React using Gatsby
site deployed using the free tier on Netlify

While Dev.to and Hashnode have been helpful to get me flowing on my technical writing, I really wanted a minimal blog over which I exercised full control. I wanted to be able style blockquotes the way I liked. I wanted the blog to reflect the From Scratch ethos that less is better.

The final straw: when I realized I was missing out on potential SEO improvements by not publishing my writing directly to fromscratchcode.com.

Thus this silly blog that I’m moderately proud of was born!

List of Requirements

My requirements for the new blog were:

Posts shall be written in Markdown and be portable, meaning require no changes to be thrown into other editors (Dev.to, Hashnode, etc). As a bonus, I currently write these in Notion because they copy out already in Markdown.
Posts shall support syntax highlighting for code blocks that actually look good. (This was a PAIN on my email provider. too bad this post isn’t about picking an email provider!)
Internal links within posts shall be optimized for navigation and SEO.

Gatsby is modern, flexible, and I’d barely used it before, which gave me the confidence I could pull this off. I wrote this post because these requirements got more difficult as I went and I hope this can be a resource for someone else. Here’s what I learned!

Supporting Markdown Posts

My motivation for writing in Markdown will not surprise you: I wanted to spend more time writing and less time formatting. To move a piece (including code!) between platforms and have it just work. Markdown checks all of those boxes.

Gatsby’s ecosystem is rich with markdown support. Here’s how I leveraged it!

I used the gatsby-transformer-remark plugin to read the Markdown files. Next, I used the createPages API in gatsby-node.js to register a new page with slug (this is a URL that for some reason is named after an insect).

Then, I used Gatsby’s createNodeField to attach additional metadata to the Markdown node. This ensures this metadata is available to components via GraphQL queries. I also used the reading-time library to calculate the reading time for each block of Markdown to give each post some fun deets and make this look like a real blog.

Here are these pieces assembled together in my gatsby-node.js file:

const BLOG_QUERY = `
    {
      allMarkdownRemark(
        filter: { fileAbsolutePath: { regex: "/blog/" } }
        sort: { frontmatter: { date: DESC } }
      ) {
        edges {
          node {
            frontmatter {
              slug
            }
          }
        }
      }
    }
  `

// This function runs a GraphQL query and creates pages at `${base_url}/${node.frontmatter.slug}`
// for each using the provided template.
// NOTE: each markdown file must provide its own slug in the frontmatter.
// TODO: could we provide a default in the case the frontmatter is not specified?
const createMarkdownPages = async (
  graphql,
  createPage,
  query,
  base_url,
  template
) => {
  const markdownFiles = await graphql(query)

  markdownFiles.data.allMarkdownRemark.edges.forEach(({ node }) => {
    const slug = `${base_url}/${node.frontmatter.slug}`
    createPage({
      path: slug,
      component: template,
      // The context is needed for the $slug param lookup in the query inside the
      // repsective template
      context: {
        slug,
      },
    })
  })
}

exports.createPages = async ({ graphql, actions }) => {
  const { createPage } = actions
  const blogPostTemplate = path.resolve(`src/templates/blogPostTemplate.js`)

  await createMarkdownPages(
    graphql,
    createPage,
    BLOG_QUERY,
    "/blog",
    blogPostTemplate
  )
}

exports.onCreateNode = ({ node, actions, getNode }) => {
  const { createNodeField } = actions
  if (node.internal.type === "MarkdownRemark") {
    let slug = node.frontmatter.slug

    // Determine the base path (e.g., 'policies' or 'blog')
    // This is the name from the gatsby-source-filesystem config in gatsby-config.js
    const sourceInstanceName = getNode(node.parent).sourceInstanceName

    // Add the slug field to the node, which will become queryable from GraphQL. This is safer
    // than relying on frontmatter within the component pages because we have applied all the
    // necessary transformations here before we write to the field.
    createNodeField({
      node,
      name: "slug",
      value: `/${sourceInstanceName}/${slug}`,
    })

    // Calculate the reading time of the markdown content and add it to the GraphQL
    const stats = readingTime(node.rawMarkdownBody)
    createNodeField({
      node,
      name: "readingTime",
      value: stats.text, // Example: "3 min read"
    })
  }
}

I trimmed for brevity (HA!), but I process my Terms of Use and Privacy Policy using the same procedure. While those would not need syntax highlighting, requirement #3 about optimized internal links would still apply.

Next, I used Gatsby’s GraphQL to query for the post content based on the slug of the rendered page.

This is my first pass at blogPostTemplate.js. I have stripped out a few pieces to focus on the GraphQL query.

import React from "react"
import { graphql } from "gatsby"

const BlogPost = ({ data }) => {
  const post = data.markdownRemark

  return (
    <Layout>
      <article>
        <h1>{post.frontmatter.title}</h1>
        <div dangerouslySetInnerHTML={{ __html: post.html }} />
      </article>
    </Layout>
  )
}
export const query = graphql`
  query ($slug: String!) {
    markdownRemark(fields: { slug: { eq: $slug } }) {
      frontmatter {
        title
        date(formatString: "MMM DD, YYYY")
      }
      fields {
        readingTime
      }
      html
    }
  }
`

export default BlogPost

Consider the following piece of frontmatter, the metadata at the top of a Markdown file:

---
title: "Introducing: From Scratch Code"
date: "2024-11-04"
slug: "introducing-from-scratch-code"
---

At this point, we have created and populated a page at /blog/introducing-from-scratch-code/. Awesome!

Enabling Syntax Highlighting

Syntax highlighting ended up taking two tries, both using PrismJS.

First Attempt: `gatsby-remark-prismjs`

My first approach used the gatsby-remark-prismjs plugin (yes, this is a plugin [prismjs] to a plugin [remark] to a framework [gatsby]) during the Gatsby build pipeline. Here was the addition to my gatsby-config.js file.

{
  resolve: `gatsby-transformer-remark`,
  options: {
    plugins: [
      {
        resolve: `gatsby-remark-prismjs`,
        options: {
          classPrefix: "language-", // Set a class prefix
          inlineCodeMarker: null, // Marker for inline code
          // Do not use prism to highlight inline code
          noInlineHighlight: true,
        },
      },
    ],
  }
}

This approach does these steps behind the scenes:

Read markdown
Convert it to HTML
Apply syntax highlighting via CSS classes
Let React render our HTML which we fetched by querying GraphQL

This worked out of the box!

To customize the theme, I added this to my gatsby-browser.js.

// Prism.js syntax highlighting
import "prismjs/themes/prism-tomorrow.css"

I had some trouble styling line numbers and the Copy-to-Clipboard button. There seemed to be some complexity around PrismJS plugins inside a static site processing pipeline such as Remark in Gatsby, so I punted on those.

This approached worked great until it didn’t, which brings me to requirement #3.

Optimizing Internal Links

When I say “internal link”, I mean a link on this website, such as /blog.

I had two motivations for optimizing these:

This is mostly me being particular, but in order to truly make my Markdown portable, I wanted to be able to write external links (such as https://fromscratchcode.com/mentorship) in Markdown and have it be treated as the internal link /mentorship.
This is the biggie: Gatsby applies a magic touch using its Link component, which consists of several things.
1. Under the hood, Gatsby prefetches any Link destination when the page loads.
2. When clicked, it uses @reach/router to navigate to it using ultra-smooth client-side navigation.
3. This is all while appearing to be an <a> component in the page source, which keeps these links *chef's kiss* perfect for SEO because they remain visible to crawlers such as Google’s.
4. In addition, our React state persists across clicks. Without this optimization, the state of the dark mode toggle would persist when using the top nav, but not when clicking a link discovered in a blog post. That is not ideal!

Second Attempt: Switching to Rehype

The challenge here is turning [mentorship](/mentorship) into <Link to="/mentorship">mentorship</Link> and have it still be rendered as React.

There is a strong ecosystem of JS libraries to help with this, but I needed to switch from using PrismJS in a Remark plugin to a Rehype plugin. That sentence would have been word salad to me a few ~~hours~~ weeks ago, so please bear with me. Gatsby’s Remark pipeline converts Markdown to HTML before React sees it, so internal links get set before React can modify them. By using Rehype, we get a hook where we can dynamically swap <a> for Link. Rehype also supports integrating PrismJS syntax highlighting. Here’s how I pulled it off!

The function below does this whole shebang in several steps:

Parse our raw markdown content into an AST (Abstract Syntax Tree).
Apply a plugin we would need to write to detect and convert external links to internal links.
Convert the markdown AST (used by Remark) into an HTML AST (used by Rehype).
Apply PrismJS syntax highlighting using a Rehype plugin.
Render the Rehype AST as React, converting any <a> elements into CustomLink components along the way. (CustomLink is a wrapper I created around Gatsby’s Link so that it works for internal or external links.)

// Render markdown to React
// This pipeline processes raw markdown and converts it into React components,
// applying syntax highlighting and internal link optimization.
const renderMarkdownToReact = markdown => {
  try {
    return unified()
      .use(remarkParse) // Parse markdown into an abstract syntax tree
      .use(optimizeInternalLinks) // Apply our plugin to detect internal links
      .use(remarkRehype) // Convert Markdown AST to Rehype AST
      .use(rehypePrism, { showLineNumbers: false }) // Add PrismJS syntax highlighting
      .use(rehypeReact, {
        jsx,
        jsxs,
        Fragment,
        components: {
          a: props => <CustomLink to={props.href} {...props} />,
        },
      })
      .processSync(markdown).result
  } catch (error) {
    console.error("Failed to render markdown:", error)
    return <div>Error rendering markdown</div>
  }
}

And our plugin to convert internal links:

const optimizeInternalLinks = () => tree => {
  visit(tree, "link", node => {
    const { url } = node
    if (url.startsWith("/") || url.startsWith(SITE_URL)) {
      const internalPath = url.replace(SITE_URL, "")
      node.url = internalPath
    }
  })
}

This function calls visit from unist-util-visit to allow us to walk the AST. Its interface matches what is expected by a Remark plugin.

PHEW. That is quite a pipeline. This is the type of problem I probably would have given up on pre-ChatGPT. With a tool that points me to exactly what libraries to use and gets me close on the initial syntax, I was able to take it the rest of the way.

Now you can click around this blog and, when I reference a past post, it should load nearly instantaneously and preserve your dark mode settings. We did it!

What’s next?

I’m pleased with how the blog has turned out! I’d like to eventually add support for tags, table of contents for each post, and perhaps a series so to link to all the Memphis posts. I could add comments using Disqus but, then again, would you invite a troll into your living room?

I’d love to hear from you! Please reach out if you find any bugs. I’m also curious what static site blog features have wow-ed you in the past, now that I am the proud owner of my own.

I’m gonna go write some Rust now.

Typed integers in Rust for safer Python bytecode compilation

Mon, 13 Jan 2025 00:00:00 GMT

Shortly after I shared my previous post, a helpful redditor pointed out that the typed integers I alluded to is known as the newtype pattern in Rust, a design that allows you to define types purely for compile-time safety and with no runtime hit.

And here I thought I invented it! Alas. 😂

I wanted to share a few details about what challenge I encountered and how I solved it.

The problem

Like I mentioned in the previous post, bytecode is a lower-level representation of your code created for efficient evaluation. One of the optimizations is that variable names are converted into indices during bytecode compilation, which supports faster lookup at runtime when the VM pumps through your bytecode.

At one point while implementing this myself, I was debugging a sequence that felt like this. (I’m using the wishy-washy “felt like” here because I have no idea what the actual bytecode looked like.)

LOAD_GLOBAL 0
LOAD_FAST 0
LOAD_CONST 0

Okay, so we’re loading the first global, the first local, and the first constant. Given the ongoing evolution of my VM design and implementation, the data structures I’m using to communicate the mappings of these indices to their symbol names and/or values between the compiler and VM have been in a steady flux.

I’d sit down to implement the execution of a given opcode in the VM, see that I was handed the value 0, and I’d think, “What do they want me to do with this?!” (“they” in the case being myself from 5 minutes prior hacking on the compiler stage.)

After interpreting a 0 wrong for the forth or fifth time, I decided THERE HAS TO BE A BETTER WAY. I asked myself, “Is there a way to get the compiler to tell me when I’m using a 0 incorrectly?” In addition to compile-time checking, with rust-analyzer configured in my Neovim, I should get a type error as soon as I made the mistake.

Could I add types to my integers?!

The CPython bytecode docs even hint at these distinctions, but incorporating them into my own implementation wasn’t immediately clear until I experienced the pain firsthand. For my three opcodes above, the respective documentation is:

LOAD_GLOBAL(namei)
Loads the global named co_names[namei>>1] onto the stack.

LOAD_FAST(var_num)
Pushes a reference to the local co_varnames[var_num] onto the stack.

LOAD_CONST(consti)
Pushes co_consts[consti] onto the stack.

co_names, co_varnames, and co_consts are three fields on a CPython code object, which is an immutable piece of compiled bytecode. In my interpreter, I’m using names and varnames to show my originality.

(Note to self: I treat constants separately at the moments, but I should probably unify it as I solidify my understanding of code objects.)

(Another note to self: how curious that they are shifting namei to the right during LOAD_GLOBAL. I hope to one day know why.)

The solution

I added typed integers to keep myself sane. They offered an additional benefit of providing me an opportunity to gain more experience with generics in Rust!

Here is a subset of my Opcode implementation.

pub enum Opcode {
    /// Push the value found at the specified index in the constant pool onto the stack.
    LoadConst(ConstantIndex),
    /// Read the local variable indicated by the specified index and push the value onto the stack.
    LoadFast(LocalIndex),
    /// Read the global variable indicated by the specified index and push the value onto the stack.
    LoadGlobal(NonlocalIndex),
}

My hope here is these new types, ConstantIndex, LocalIndex, and NonlocalIndex, would semantically illustrate the behavior of each opcode while providing type safety.

Generics entered the scene next as I was hoping to implement this with minimal code reuse.

Here is the next layer of the onion.

pub type ConstantIndex = Index<ConstantMarker>;
pub type LocalIndex = Index<LocalMarker>;
pub type NonlocalIndex = Index<NonlocalMarker>;

We’re beginning to see some code reuse–awesome!

What are these marker types? They are truly that: a type which is literally just a marker with no other data. These empty types are enforced at compile-time and do nothing at runtime.

pub struct ConstantMarker;
pub struct LocalMarker;
pub struct NonlocalMarker;

In Rust, PhantomData<T> from std::marker allows you to include a type in a struct purely for compile-time checks without affecting runtime performance—exactly the type safety for which we’re seeking! I’m using usize for the value because my use-case was indices, which should never be negative.

/// An unsigned integer wrapper which provides type safety. This is particularly useful when
/// dealing with indices used across the bytecode compiler and the VM as common integer values such
/// as 0, 1, etc, can be interpreted many different ways.
#[derive(Copy, Clone, PartialEq, Hash, Eq)]
pub struct Index<T> {
    value: usize,
    _marker: PhantomData<T>,
}

impl<T> Index<T> {
    pub fn new(value: usize) -> Self {
        Self {
            value,
            _marker: PhantomData,
        }
    }
}

impl<T> Deref for Index<T> {
    type Target = usize;

    fn deref(&self) -> &Self::Target {
        &self.value
    }
}

impl<T> Display for Index<T> {
    fn fmt(&self, f: &mut Formatter) -> Result<(), Error> {
        write!(f, "{}", self.value)
    }
}

The last piece of magic is impl<T> Deref for Index<T>, which allows us to treat instances of our new type as integers when dereferenced.

As a result, a piece of code like this would pass with flying colors.

#[test]
fn test_dereference() {
    let index: LocalIndex = Index::new(4);
    assert_eq!(*index, 4)
}

By this point, we are free to use these in the bytecode compiler! Most places in the code don’t require specifying the generic because it will be inferred by the function signature, like in this example.

fn get_or_set_local_index(&mut self, name: &str) -> LocalIndex {
    if let Some(index) = self.get_local_index(name) {
        index
    } else {
        let code = self.ensure_code_object_mut();
        let new_index = code.varnames.len();
        code.varnames.push(name.to_string());
        Index::new(new_index)
    }
}

We initialize a new index with Index::new(new_index), which the compiler knows to treat as an Index<LocalMarker>, which we have aliased to LocalIndex to allow us to be blissfully ignorant VM developers.

The end

This is a small but powerful example of how I’m falling head-over-heels for Rust’s type system. I love writing expressive code to implement core features from scratch in a way which manages complexity and makes people smile.

My mentor at my first big-boy job was a wizard at this and I credit him for showing me what is possible: spending more time thinking about what you’re trying to build rather than being bogged down by how you are trying to build it.

Have you used Rust’s type system in any fun and creative ways? Or done something similar in another language? I’d love to hear your thoughts in the comments. Be well!

How I added support for nested functions in Python bytecode

Mon, 30 Dec 2024 00:00:00 GMT

I wanted to share some pretty cool stuff I’ve been learning about Python bytecode with you, including how I added support for nested functions, but my guy at the printing press said I needed to keep it under 500 words.

It’s a holiday week, he shrugged. What do you expect me to do?

Excluding code snippets, I bargained.

Fine, he ceded.

Do you know why we use bytecode in the first place?

I just operate the printing press, I trust you though.

Fair enough. Let’s begin.

Why we use bytecode in the first place

Memphis, my Python interpreter written in Rust, has two execution engines. Neither can run all code but both can run some code.

My treewalk interpreter is what you would build if you didn’t know what you were doing. 🙋‍♂️ You tokenize the input Python code, generate an abstract syntax tree (AST), and then walk the tree and evaluate each node. Expressions return values and statements modify the symbol table, which is implemented as a series of scopes which respect Python scoping rules. Just remember the easy pneumonic LEGB: local, enclosing, global, builtin.

My bytecode VM is what you would build if you didn’t know what you were doing but wanted to act like you did. Also 🙋‍♂️. For this engine, the tokens and AST work the same, but rather than walking we take off sprinting. We compile the AST into an intermediate representation (IR) hereafter known as bytecode. We then create a stack-based virtual machine (VM), which conceptually acts like a CPU, executing bytecode instructions in sequence, but it’s implemented entirely in software.

(For a complete guide of both approaches without the ramblings, Crafting Interpreters is excellent.)

Why do we do this in the first place? Just remember the two Ps: portability and performance. Remember how in the early 2000s nobody would shut up about how Java bytecode was portable? All you need is a JVM and you can run a Java program compiled on any machine! Python chose not to go with this approach for both technical and marketing reasons, but in theory the same principles apply. (In practice, the compilation steps are different and I regret opening this can of worms.)

Performance is the biggie though. Rather than traversing an AST multiple times during the lifetime of a program, the compiled IR is a more efficient representation. We see improved performance from avoiding the overhead of repeatedly traversing an AST, and its flat structure often results in better branch prediction and cache locality at runtime.

(I don’t blame you for not thinking about caching if you don’t have a background in computer architecture—heck, I began my career in that industry and I think about caching far less than I think about how to avoid writing the same line of code twice. So just trust me on the performance piece. That’s my leadership style: blind trust.)

Hey buddy, that’s 500 words. We need to load up the frame and let ‘er rip.

Already?! You excluded code snippets?

There are no code snippets, my man.

Okay okay. Just 500 more. I promise.

Context matters for Python variables

I got kinda far before tabling my bytecode VM implementation about a year ago: I could define Python functions and classes and call those functions and instantiate those classes. I clamped down this behavior with some tests. But I knew my implementation was messy and that I’d need to revisit the fundamentals before adding more fun stuff. Now it’s Christmas week and I want to add fun stuff.

Consider this snippet for calling a function, keeping an eye on the TODO.

fn compile_function_call(
    &mut self,
    name: &str,
    args: &ParsedArguments)
) -> Result<Bytecode, CompileError> {
    let mut opcodes = vec![];

    // We push the args onto the stack in reverse call order so that we will pop
    // them off in call order.
    for arg in args.args.iter().rev() {
        opcodes.extend(self.compile_expr(arg)?);
    }

    let (_, index) = self.get_local_index(name);

    // TODO how does this know if it is a global or local index? this may not be the right
    // approach for calling a function
    opcodes.push(Opcode::Call(index));

    Ok(opcodes)
}

Are you done considering? We load the function arguments onto the stack and “call the function”. In bytecode, all names are converted into indices (because index access is faster during the VM runtime), but we don’t really have a way to know whether we are dealing with a local index or a global index here.

Now consider the improved version.

fn compile_function_call(
    &mut self,
    name: &str,
    args: &ParsedArguments)
) -> Result<Bytecode, CompileError> {
    let mut opcodes = vec![self.compile_load(name)];

    // We push the args onto the stack in reverse call order so that we will pop
    // them off in call order.
    for arg in args.args.iter().rev() {
        opcodes.extend(self.compile_expr(arg)?);
    }

    let argc = opcodes.len() - 1;
    opcodes.push(Opcode::Call(argc));

    Ok(opcodes)
}

Thank you for considering that code.

We now supported nested function calls! What changed?

The Call opcode now takes a number of positional arguments, rather than an index to the function. This instructs the VM how many arguments to pop off the stack before calling the function.
After popping the arguments off the stack, the function itself will be left on the stack and compile_load has already handled local versus global scope for us.

LOAD_GLOBAL versus LOAD_FAST

Let’s take a look at what compile_load is doing.

fn compile_load(&mut self, name: &str) -> Opcode {
    match self.ensure_context() {
        Context::Global => Opcode::LoadGlobal(self.get_or_set_nonlocal_index(name)),
        Context::Local => {
            // Check locals first
            if let Some(index) = self.get_local_index(name) {
                return Opcode::LoadFast(index);
            }

            // If not found locally, fall back to globals
            Opcode::LoadGlobal(self.get_or_set_nonlocal_index(name))
        }
    }
}

There are several key principles in action here:

We match based on the current context. Adhering to Python semantics, we can consider Context::Global to be at the top level of any module (not just your script’s entrypoint), and Context::Local is inside any block (i.e. function definition or class definition).
We now differentiate between a local index and a non-local index. (Because I was going crazy trying to decipher what the index 0 referred to in different places, I introduced typed-integers. LocalIndex and NonlocalIndex provide type-safety for otherwise untyped unsigned integers. I may write about this in the future!)
We can tell at bytecode-compilation time whether a local variable exists with a given name, and if it does not, at runtime we will search for a global variable. This speaks to the dynamism built into Python: as long as a variable is present in the global scope of that module by the time a function executes, its value can be resolved at runtime. However, this dynamic resolution comes with a performance hit. While local variable lookups are optimized to use stack indices, global lookups require searching the global namespace dictionary, which is slower. This dictionary is a mapping of names to objects, which themselves may live on the heap. Who knew that the saying “Think globally, act locally.” was actually referring to Python scopes?

What’s in a varname?

The last thing I’ll leave you with today is a peek into how these variables names are mapped. In the code snippet below, you’ll notice that local indices are found in code.varnames and nonlocal indices are found in code.names. Both live on a CodeObject, which contains the metadata for a block of Python bytecode, including its variable and name mappings.

fn get_or_set_local_index(&mut self, name: &str) -> LocalIndex {
    if let Some(index) = self.get_local_index(name) {
        index
    } else {
        let code = self.ensure_code_object_mut();
        let new_index = code.varnames.len();
        code.varnames.push(name.to_string());
        Index::new(new_index)
    }
}

fn get_local_index(&self, name: &str) -> Option<LocalIndex> {
    let code = self.ensure_code_object();
    find_index(&code.varnames, name).map(Index::new)
}

fn get_or_set_nonlocal_index(&mut self, name: &str) -> NonlocalIndex {
    let code = self.ensure_code_object_mut();
    if let Some(index) = find_index(&code.names, name) {
        Index::new(index)
    } else {
        let new_index = code.names.len();
        code.names.push(name.to_string());
        Index::new(new_index)
    }
}

The difference between varnames and names tormented me for weeks (CPython calls these co_varnames and co_names), but it’s actually fairly straightforward. varnames holds the variable names for all local variables in a given scope, and names does the same for all nonlocal.

Once we properly track this, everything else just works. At runtime, the VM sees a LOAD_GLOBAL or a LOAD_FAST and knows to look in the global namespace dictionary or the local stack, respectively.

Buddy! Mr Gutenberg is on the phone and says we can hold the presses no longer.

Okay! Fine! I get it! Let’s ship it. 🚢

What’s next for Memphis?

Shh! The printing press man doesn’t know I’m writing a conclusion, so I will be brief.

With variable scoping and function calls in a solid place, I’m gradually turning my attention to features like stack traces and async support. If you’ve enjoyed this dive into bytecode or have questions about building your own interpreter, I’d love to hear from you—drop a comment!

Improving memory efficiency in a working interpreter

Mon, 16 Dec 2024 00:00:00 GMT

Lifetimes are a fascinating feature of Rust and the human experience. This is a technical blog, so let’s focus on the former. I was admittedly a slow adopter for leveraging lifetimes to safely borrow data in Rust. In the treewalk implementation of Memphis, my Python interpreter written in Rust, I hardly leverage lifetimes (by cloning incessantly) and I repeatedly elude the borrow checker (by using interior mutability, also incessantly) whenever possible.

My fellow Rustaceans, I am here to today to tell you this ends now. Read my lips……no more shortcuts.

Okay okay, let’s be real. What is a shortcut versus what’s the right way is a matter of priorities and perspective. We’ve all made mistakes, and I’m here to take accountability for mine.

I began writing an interpreter six weeks after I first installed rustc because I have no chill. With that haranguing and posturing out of the way, let’s begin today’s lecture on how we can use lifetimes as our lifeline to improve my bloated interpreter codebase.

Identifying and avoiding cloned data

A Rust lifetime is a mechanism which provides a compile-time guarantee that any references do not outlive the objects to which they refer. They allow us to avoid the “dangling pointer” problem from C and C++.

This is assuming you leverage them at all! Cloning is a convenient workaround when you want to avoid the complexities associated with managing lifetimes, though the downside is increased memory usage and a slight delay related to each time data is copied.

Using lifetimes also forces you to think more idiomatically about owners and borrowing in Rust, which I was eager to do.

I chose my first candidate as the tokens from a Python input file. My original implementation, which relied heavily on ChatGPT guidance while I was sitting on Amtrak, used this flow:

we pass our Python text to a Builder
the Builder creates a Lexer, which tokenizes the input stream
the Builder then creates a Parser, which clones the token stream to hold its own copy
the Builder is used to create an Interpreter, which repeatedly asks the Parser for its next parsed statement and evaluates it until we reach the end of the token stream

The convenient aspect of cloning the token stream is that the Lexer was free to be dropped after step 3. By updating my architecture to have the Lexer own the tokens and the Parser just borrow them, the Lexer would now be required to stay alive much longer. Rust lifetimes would guarantee this for us: as long as the Parser existed holding a reference to a borrowed token, the compiler would guarantee that the Lexer which own those tokens still existed, ensuring a valid reference.

Like all code always, this ended up being a bigger change than I expected. Let’s see why!

The new parser

Before updating the Parser to borrow the tokens from the Lexer, it looked like this. The two fields of interest for today’s discussion are tokens and current_token. We have no idea how large the Vec<Token> is, but it is distinctly ours (i.e. we are not borrowing it).

pub struct Parser {
    state: Container<State>,
    tokens: Vec<Token>,
    current_token: Token,
    position: usize,
    line_number: usize,
    delimiter_depth: usize,
}

impl Parser {
    pub fn new(tokens: Vec<Token>, state: Container<State>) -> Self {
        let current_token = tokens.first().cloned().unwrap_or(Token::Eof);
        Parser {
            state,
            tokens,
            current_token,
            position: 0,
            line_number: 1,
            delimiter_depth: 0,
        }
    }
}

After borrowing the tokens from the Lexer, it looks fairly similar, but now we see a LIFETIME! By connecting tokens to the lifetime 'a, the Rust compiler will not allow the owner of the tokens (which is our Lexer) and the tokens themselves to be dropped while our Parser still references them. This feels safe and fancy!

static EOF: Token = Token::Eof;

/// A recursive-descent parser which attempts to encode the full Python grammar.
pub struct Parser<'a> {
    state: Container<State>,
    tokens: &'a [Token],
    current_token: &'a Token,
    position: usize,
    line_number: usize,
    delimiter_depth: usize,
}

impl<'a> Parser<'a> {
    pub fn new(tokens: &'a [Token], state: Container<State>) -> Self {
        let current_token = tokens.first().unwrap_or(&EOF);
        Parser {
            state,
            tokens,
            current_token,
            position: 0,
            line_number: 1,
            delimiter_depth: 0,
        }
    }
}

Another small difference you may notice is this line:

static EOF: Token = Token::Eof;

This is a small optimization that I began considering once my Parser was moving in the direction of “memory-efficient.” Rather than instantiating a new Token::Eof each time the Parser needs to check if it is at the end of the text stream, the new model allowed me to instantiate only a single token and reference &EOF repeatedly.

Again, this is a small optimization, but it speaks to the larger mindset of each piece of data existing only once in memory and every consumer just referencing it when needed, which Rust both encourages you to do and snugly holds your hand along the way.

Speaking of optimization, I really should have benchmarked the memory usage before and after. Since I did not, I have nothing more to say on the matter.

As I alluded to earlier, tying the lifetime of my Lexer and Parser together a large impact on my Builder pattern. Let’s see what that looks like!

The new Builder: MemphisContext

In the flow I described above, remember how I mentioned that the Lexer could be dropped as soon as the Parser created its own copy of the tokens? This had unintentionally influenced the design of my Builder, which was intended to be the component which supports orchestrating Lexer, Parser, and Interpreter interactions, whether you begin with a Python text stream or a path to a Python file.

As you can see below, there are a few other non-ideal aspects to this design:

needing to call a dangerous downcast method to get the Interpreter.
why did I think it was okay to return a Parser to every unit test just to then pass it right back into interpreter.run(&mut parser)?!

fn downcast<T: InterpreterEntrypoint + 'static>(input: T) -> Interpreter {
    let any_ref: &dyn Any = &input as &dyn Any;
    any_ref.downcast_ref::<Interpreter>().unwrap().clone()
}

fn init(text: &str) -> (Parser, Interpreter) {
    let (parser, interpreter) = Builder::new().text(text).build();

    (parser, downcast(interpreter))
}


#[test]
fn function_definition() {
     let input = r#"
def add(x, y):
    return x + y

a = add(2, 3)
"#;
    let (mut parser, mut interpreter) = init(input);

    match interpreter.run(&mut parser) {
        Err(e) => panic!("Interpreter error: {:?}", e),
        Ok(_) => {
            assert_eq!(
                interpreter.state.read("a"),
                Some(ExprResult::Integer(5.store()))
            );
        }
    }
}

Below is the new MemphisContext interface. This mechanism manages the Lexer lifetime internally (to keep our references alive long enough to keep our Parser happy!) and only exposes what is needed to run this test.

fn init(text: &str) -> MemphisContext {
    MemphisContext::from_text(text)
}

#[test]
fn function_definition() {
    let input = r#"
def add(x, y):
    return x + y

a = add(2, 3)
"#;
    let mut context = init(input);

    match context.run_and_return_interpreter() {
        Err(e) => panic!("Interpreter error: {:?}", e),
        Ok(interpreter) => {
            assert_eq!(
                interpreter.state.read("a"),
                Some(ExprResult::Integer(5.store()))
            );
        }
    }
}

context.run_and_return_interpreter() is still a bit clunky and speaks to another design problem I may tackle down the road: when you run the interpreter, do you want to return only the final return value or something which lets you access arbitrary values from the symbol table? This method opts for the latter approach. I actually think there’s a case to do both, and will keep tweaking my API to allow for this as we go.

Incidentally, this change improved my ability to evaluate an arbitrary piece of Python code. If you’ll recall from my WebAssembly saga, I had to rely on my crosscheck TreewalkAdapter to do that at the time. Now, our Wasm interface is much cleaner.

#[cfg(feature = "wasm")]
mod wasm {
    use console_error_panic_hook::set_once;
    use wasm_bindgen::prelude::wasm_bindgen;

    use super::init::MemphisContext;

    #[wasm_bindgen]
    pub fn evaluate(code: String) -> String {
        // Set the panic hook for better error messages in the browser console
        set_once();

        let mut context = MemphisContext::from_text(&code);
        let result = context
            .evaluate_oneshot()
            .expect("Failed to evaluate expression.");
        format!("{}", result)
    }
}

The interface context.evaluate_oneshot() returns the expression result rather than a full symbol table. I wonder if there’s a better way to ensure any of the “oneshot” methods can only operate on a context once, ensuring that no consumers use them in a stateful context. I’ll keep simmering on that!

Was this worth it?

Memphis is first-and-foremost a learning exercise, so this was absolutely worth it!

In addition to sharing the tokens between the Lexer and the Parser, I created an interface to evaluate Python code with significantly less boilerplate. While sharing data introduced additional complexity, these changes bring clear benefits: reduced memory usage, improved safety guarantees through stricter lifetime management, and a streamlined API that’s easier to maintain and extend.

I’m choosing to believe this was the right approach, mostly to maintain my self-esteem. Ultimately, I aim to write code that clearly reflects the principles of software and computer engineering. We can now open the Memphis source, point to the single owner of the tokens, and sleep soundly at night!

An interpreter inside an interpreter

Mon, 25 Nov 2024 00:00:00 GMT

A few months into development, I decided my north star for Memphis would be to run a Flask server entirely within my interpreter. I had no idea how much work this would entail, only that it sounded cool and would probably teach me a lot along the way. If I were making this goal today, I may pick FastAPI or nothing at all because that was silly of me.

Python stdlib

A big decision I encountered was how to deal with the Python standard lib. As you are likely familiar, the standard lib of a language is not technically part of the language definition or runtime. It is included with releases in order to make the language and runtime more useful. Imagine Python without threading or async support. You would still be able to evaluate expressions and instantiate classes, but most production-ready programs need some sort of concurrency support.

One option would be to rewrite the entire standard lib myself. I’m building an interpreter, aren’t I? I believe this is the approach taken by RustPython, which is an admirable path. I figured I had enough on my plate getting the runtime to work, was looking for any and all corners to cut, and decided against this.

The Python standard lib consists of two main parts: the parts implemented in Python and the parts implemented in C. Conveniently enough, I had my own Python interpreter. Could I just interpret the Python source file from the host machine to satisfy the former? Yes, I could. I’d need to support every syntax and feature they used, but after that, it would Just Work.

The C part is where it gets interesting. Way back yonder in 2023, I made a decision to embed a Python interpreter inside my Python interpreter without fully understanding what that meant. Now it was time to wrap my head around this and decide if I wanted to stay with this approach or chose another path.

The interop shop for Rust and Python is Pyo3. As the only game in town, Pyo3 uses the Foreign Function Interface (FFI) to allow your Rust code to make calls into the CPython binary. This works by agreeing on the Application Binary Interface (ABI), a concept I used during my career at AMD. Core software ftw!

Importing modules

My initial use-case was to run import sys and have it give me an object on which I could perform a member access operation. I’m getting into interpreter-speak here, but this is the type of REPL session I’m talking about.

Python 3.12.5 (main, Aug  6 2024, 19:08:49) [Clang 15.0.0 (clang-1500.3.9.4)]
Type "help", "copyright", "credits" or "license" for more information.
>>> import sys
>>> sys
<module 'sys' (built-in)>
>>> type(sys.modules)
<class 'dict'>

Getting this functionality using Pyo3 was straightforward.

pub struct CPythonModule(PyObject);

impl CPythonModule {
    pub fn new(name: &str) -> Self {
        pyo3::prepare_freethreaded_python();
        let pymodule = Python::with_gil(|py|
            PyModule::import(py, name).expect("Failed to import module").into()
        );

        Self(pymodule)
    }
}

And we can use this to drive a similar REPL session in Memphis, assuming you remember the cocktail of features flags to get this to run.

memphis 0.1.0 REPL (Type 'exit()' to quit)
>>> import sys
>>> sys
<module 'sys' (built-in)>
>>> type(sys.modules)
<class 'dict' (built-in)>

If you’re asking yourself, couldn’t you just use this approach to import the entire standard lib (including the parts written in Python and C) and make your entire life, liberty, and the pursuit of happiness, easier, the answer is yes. That would be a valid approach! However, that would make my interpreter more of a shell around CPython than I would like. This is a learning exercise so I’m all for arbitrary decisions. For the purists out there who say loading any piece of CPython inside Memphis makes Memphis not a real interpreter, I would just say: please show me your interpreter.

I conducted a quick test with htop by running import sys inside a REPL session using both Memphis and CPython. On Memphis, because this load the CPython libraries into memory, it increased the RAM usage (Resident Set Size in htop) by about 5MB. For comparison, the Memphis REPL after loading the sys module uses about 9MB of RAM, while the Python REPL before and after loading the sys module uses about the same. I’m sure this isn’t an apples-to-apples comparison, but it at least told me that Memphis wasn’t gonna slowly choke my computer to death.

Converting objects and getting existential

The next complexity with this setup involves converting my Memphis object representation into CPython representations and vice versa. This is a work-in-progress and my primary directive was, initially, “do not fail” and, more recently, “show warnings when you do a lossy conversion.”

Here is my conversion from a PyObject, which is the object representation on the Pyo3 side, into an ExprResult, my Memphis representation.

pub mod utils {
    pub fn from_pyobject(py: Python, py_obj: &PyAny) -> ExprResult {
        if let Ok(value) = py_obj.extract::<i64>() {
            ExprResult::Integer(Container::new(value))
        } else if let Ok(value) = py_obj.extract::<f64>() {
            ExprResult::FloatingPoint(value)
        } else if let Ok(value) = py_obj.extract::<&str>() {
            ExprResult::String(Str::new(value.to_string()))
        } else if let Ok(py_tuple) = py_obj.extract::<&PyTuple>() {
            let elements = py_tuple
                .iter()
                .map(|item| from_pyobject(py, item))
                .collect();
            ExprResult::Tuple(Container::new(Tuple::new(elements)))
        } else if let Ok(py_module) = py_obj.extract::<&PyModule>() {
            let mut module = Module::default();

            // Get the module's __dict__ to iterate over all attributes
            for (key, value) in py_module.dict() {
                let key_str: String =
                  key.extract().expect("Key is not a string");
                let expr_value = from_pyobject(py, value);
                module.insert(&key_str, expr_value);
            }

            ExprResult::Module(Container::new(module))
        } else if let Ok(py_set) = py_obj.extract::<&PySet>() {
            let elements = py_set
                .iter()
                .map(|item| from_pyobject(py, item))
                .collect();
            ExprResult::Set(Container::new(Set::new(elements)))
        } else if let Ok(py_list) = py_obj.extract::<&PyList>() {
            let elements = py_list
                .iter()
                .map(|item| from_pyobject(py, item))
                .collect();
            ExprResult::List(Container::new(List::new(elements)))
        } else {
            // TODO think of a way to detect whether this is an object we can
            // convert or not
            // log(LogLevel::Warn, || {
            //     "Potentially ambiguous CPythonObject instance.".to_string()
            // });
            ExprResult::CPythonObject(CPythonObject::new(py_obj.into_py(py)))
        }
    }
}

And here is the reverse comparison. Note that for both of these we must pass in a Python object, which controls our access to the CPython GIL (global interpreter lock).

impl ToPyObject for ExprResult {
    fn to_object(&self, py: Python) -> PyObject {
        match self {
            ExprResult::None => py.None(),
            ExprResult::Boolean(b) => b.to_object(py),
            ExprResult::Integer(i) => i.borrow().to_object(py),
            ExprResult::String(s) => s.as_str().to_object(py),
            ExprResult::List(l) => {
                let list = PyList::empty(py);
                for item in l.clone().into_iter() {
                    list.append(item).expect("Failed to append to PyList");
                }
                list.to_object(py)
            }
            ExprResult::Function(_) => {
                // TODO our PyCFunction implementation is a no-op, we need to find a way to pass
                // the interpreter into here.
                let callback = |_args: &PyTuple, _kwargs: Option<&PyDict>| -> PyResult<bool> {
                    log(LogLevel::Warn, || {
                        "Potentially lossy PyCFunction invocation.".to_string()
                    });
                    Ok(true)
                };
                // TODO use real function name
                let py_cfunc = PyCFunction::new_closure(
                    py,
                    Some("memphis_func"),
                    None,
                    callback
                ).unwrap();
                py_cfunc.to_object(py)
            }
            ExprResult::Class(_) => {
                // TODO same here, our PyClass implementation does bring real fields
                Py::new(py, TestClass {}).unwrap().to_object(py)
            }
            ExprResult::Module(module) => {
                let py_module = PyModule::new(py, &module.borrow().name()).unwrap();

                // Flatten all key-value pairs from scope into the module
                for (key, value) in module.borrow().dict() {
                    py_module.add(key, value.to_object(py)).unwrap();
                }

                py_module.to_object(py)
            }
            ExprResult::CPythonModule(module) => module.borrow().0.to_object(py),
            ExprResult::CPythonObject(object) => object.0.to_object(py),
            _ =>
                unimplemented!(
                    "Attempting to convert {} to a PyObject, but {} conversion is not implemented!",
                    self,
                    self.get_type()
                ),
        }
    }
}

This is a rich area that I’d like to explore further. Here are some of the directions I’ve considered:

Convert each time an object crosses the FFI interface. (And yes, I realize that acronym expands to foreign function interface interface.) That’s roughly what I’m already doing, I would just need to own it and not feel like an imposter. This could be simple but inefficient.
Keep a registry so that each object exists at most once on each side. This would be more efficient than (1), but it’d require a stable value which you could use to lookup and link up these objects.
Aim for a single representation on the Rust side and use Pyo3 to proxy and lazily convert fields as needed. I believe this would still leverage the functionality of (1), but in a more efficient manner.
Make the memory layout of a Memphis object match that of a PyObject. Similar to how #[repr(C)] already works in Rust, this would be similar to the role an ABI plays for a function call. I’m not even sure if this one is possible given the difference in what each side needs to do its evaluation, but this intrigues me.

I’m getting ahead of myself because I can barely load a C module right now, but there’s truly no end to where my curiosity could take me in this area.

The End

I continue to poke at this when I hit a new conversion failure while plodding along towards getting Flask to boot. This exercise is a good reminder that all objects (or classes, modules, etc) are a set of attributes that exist in a known format in memory. If we understand that format well enough, we should be able to do incredible things, regardless of whether it is on the Memphis or CPython side.

This philosophy drives my work with From Scratch Code as well. If you are tired of being unable to get a library to work in your code, I encourage you to step back and ask: what the library is actually doing? Do you need it, or could a simpler solution work? I believe in cultivating this curiosity about software—and I’d be happy to help you incorporate this mindset into your toolbox.

Building for WebAssembly

Mon, 18 Nov 2024 00:00:00 GMT

I’m currently exploring two interesting topics for Memphis, my Python interpreter in Rust: building for WebAssembly and embedding CPython. With no major milestones to report this week, I thought I’d share some in-progress thoughts. For me, Memphis is been a project for expanding my conceptual understanding through practical experiments—hopefully, this post can do the same for you as we walk through some of the design decisions I'm exploring.

Python in the browser

Compiling Memphis to a WebAssembly target had been in the back of my mind for some time, and two Saturdays ago, I finally gave it a go. With a lukewarm cup of drip coffee on my coaster, I cracked my knuckles and began.

WebAssembly is a sandboxed execution environment inside modern web browsers which complements the traditional JavaScript environment. The Wasm environment is closer to native code and can be used for tasks which benefit from a more performant CPU context; think number crunching or silly busy loops. I was interested in it less from a performance perspective and more because it was possible at all. One of Rust’s selling points (out of literally bajillions) is it can target Wasm. How do, one might ask? This is possible because Rust uses LLVM as its compiler backend. The Rust compiler frontend produces LLVM Intermediate Representation (IR) code and LLVM can compile this to native code for dozens of targets.

That’s a pretty massive benefit and I was curious if it would Just Work for Memphis. I had given literally zero thought to running Python in the browser before, so this seemed like a perfect opportunity to test out the Wasm learning curve.

Setting Up Wasm-Pack and Building for WebAssembly

I fired up my AI assistant and asked for the launch sequence. It went beep boop beep boop. Below are the steps annotated with my learnings along the way.

# wasm-pack helps compile our Rust code to WebAssembly and bundle it with JavaScript bindings we
# can call from our HTML/JavaScript page.
cargo install wasm-pack

# wasm-pack also downloads the wasm32-unknown-unknown target via rustup for us.
# If for whatever reason it does not, you can use this: rustup target add wasm32-unknown-unknown
# We must specify a feature flag because our wasm_bindgen interface is behind the wasm feature flag.
wasm-pack build --target web --out-dir wasm_ui/pkg -- --features wasm

The build succeeded on my first try! However, because we haven’t marked any functions in our Rust binary as being available to call from WebAssembly, it doesn’t do much.

We can install the wasm-bindgen crate to do this, which I put behind a feature flag. I added this to my Cargo.toml.

[dependencies]
wasm-bindgen = { version = "0.2", optional = true }

[features]
wasm = ["wasm-bindgen"]

Here’s a small piece of code I added to my src/lib.rs file, behind the wasm feature flag. The greet function is decorated with #[wasm_bindgen] to make this symbol available in JavaScript.

#[cfg(feature = "wasm")]
mod wasm {
    use wasm_bindgen::prelude::wasm_bindgen;

    // Export a function to JavaScript
    #[wasm_bindgen]
    pub fn greet() -> String {
        "Hello from WebAssembly!".to_string()
    }
}

Creating a JavaScript Interface

I also asked my AI assistant for the smallest possible piece of JavaScript I could use to test my Wasm interface. When we call init(), the browser loads the .wasm file, performs a JIT compilation step to convert the portable WebAssembly binary into native code, and initializes memory for the WebAssembly runtime.

<!DOCTYPE html>
<html lang="en">
  <head>
    <meta charset="UTF-8" />
    <title>Wasm Test</title>
  </head>
  <body>
    <script type="module">
      import init, { greet } from "./pkg/memphis.js"

      async function run() {
        await init()
        console.log(greet())
      }

      run()
    </script>
  </body>
</html>

Like a miracle among miracles, it Just Worked. Granted, I wasn’t running any Python code in the browser, but interfacing with my binary was a HUGE step that younger-me-who-could-barely-install-java did not want to undervalue.

The next step was to give it a Python expression defined in JavaScript and have the Wasm binary crunch the numbers. As I mentioned in my REPL post, every entry point in a software project is an opportunity to improve my abstractions, and it would certainly be the case again here. As I thumbed through my Memphis repo, I realized Wow, I should really have a better interface to pass a string and evaluate it as Python. Like I said, I LOVE new entry points.

For the time being, I would use my crosscheck adapter. Crosscheck is my work-in-progress testing framework to validate the treewalk interpreter and bytecode VM produce the same behavior for a given Python input. It’s named after the thing flight attendants do.

Here is my updated Rust code.

#[cfg(feature = "wasm")]
mod wasm {
    use wasm_bindgen::prelude::wasm_bindgen;

    use crosscheck::{InterpreterTest, TreewalkAdapter};

    // Export a function to JavaScript
    #[wasm_bindgen]
    pub fn greet() -> String {
        "Hello from WebAssembly!".to_string()
    }

    #[wasm_bindgen]
    pub fn evaluate(code: String) -> String {
        let result = TreewalkAdapter.execute(&code);
        format!("{}", result)
    }
}

Here is my updated JavaScript code, which invokes the new Rust evaluate function.

<!DOCTYPE html>
<html lang="en">
  <head>
    <meta charset="UTF-8" />
    <title>Wasm Test</title>
  </head>
  <body>
    <script type="module">
      import init, { greet, evaluate } from "./pkg/memphis.js"

      async function run() {
        await init()
        console.log(greet())
        const expr = "[ 2 * i for i in range(5) if i % 2 == 0 ]"
        console.log(expr, "=", evaluate(expr))
      }

      run()
    </script>
  </body>
</html>

Debugging WebAssembly Errors

Now when I ran it I got……… a console error. It crashed with an unimplemented error.

I poked around a bit and it was not clear what was causing this. You can click into the source but for a Wasm build that is just a block of assembly without references to the original Rust functions.

I did some AI chatting/Googling and found two helpful approaches. One is console_log for use in Wasm builds, which displays log statements from your Rust code in your browser console. This helped some, but what I was really looking for was a stack trace. Enter console_error_panic_hook. It gave me the Rust stack trace immediately, which was CLUTCH. If you are doing your own Wasm build, stop reading this now and add this crate. I don’t even mind if you never finish reading this post. Ferris would want you to use this crate 🦀. Here’s how I added it to my Wasm interface.

#[cfg(feature = "wasm")]
mod wasm {
    use console_error_panic_hook::set_once;
    use wasm_bindgen::prelude::wasm_bindgen;

    #[wasm_bindgen]
    pub fn evaluate(code: String) -> String {
        // Set the panic hook for better error messages in the browser console
        set_once();

        let result = TreewalkAdapter.execute(&code);
        format!("{}", result)
    }
}

My stack trace pointed me to my culprit: I was using std::env to request some OS resources, which are not allowed in a Wasm runtime (that’s the sandboxed part). I put these calls behind a feature flag (they are related to how I hack-ily determine the location of the Python standard lib on the host machine) and fired up my build again. After a few small failures related to properly displaying my return types….

IT WORKED. Here’s what I now see in my browser console.

wasm_ui/:13: Hello from WebAssembly!
wasm_ui/:15: [ 2 * i for i in range(5) if i % 2 == 0 ] = [0, 4, 8]

tldr I can run Python in the browser. (To their credit, RustPython does this too: https://rustpython.github.io/demo/. I haven’t looked deeply at their project but it seems comprehensive.) The Python list comprehension is defined in JavaScript in string form and the response list is evaluated by the Rust code compiled to Wasm and converted back into a string which can be displayed by JavaScript.

This setup only supports expressions at the moment. To evaluate statements (and later read back their results), I will need to keep state on the Rust side. I also dream of building a JavaScript REPL. That sounds like a problem for future-me (and a boring dream tbh).

The End

I’ve been talking long enough, so I’m going to hold off on discussing embedded Python until next Monday.

Apologies for the bait and switch. The content calendar waits for no one.

To be clear, by embedded Python, I mean embedding a CPython interpreter inside of Memphis, not running Python in an “embedded systems” environment. That would be hard for no reason. Unlike Memphis, which is hard for FUN.

Introducing: From Scratch Code

Mon, 04 Nov 2024 00:00:00 GMT

THE BIG CITY—From Scratch Enterprises LLC (ticker: FSE) announced its newest venture Monday, From Scratch Code (ticker: FSC). Members of the media gathered around the folding chair of its owlish founder, Jones Beach. Refreshments were not provided.

Whispers circulated among the media contingent that this was the same desk which produced the not-a-non-profit, From Scratch Press (ticker: FSP). The representative present could not confirm and barely glanced up from their phone.

“After becoming the market leader in telling autism stories no one asked for, we stepped back and asked ourselves what was next,” said Beach. “It became clear that we could go beyond the abcs and move into 1s and 0s.”

The event continued with a personal statement read aloud by Beach, which was a weird format, but seemed heartfelt.

I never set out to make the killer app. When I was building an early project—a website about stadiums—people asked me when they could expect an app, I looked at them feeling under-appreciated and directed them to my clumsy website. I’m not motivated by attempting to build the next big thing, but by creating something genuine and functional.

My skill set as a software engineer is typically valued through monetization rather than words of affirmation. I’m not asking for sympathy about this; I’m incredibly fortunate that people chose to pay me to write code for them for nearly a decade. But when this system stopped working for me, I looked around for what else I could do with my skills. I’m driven by curiosity and a genuine desire to support others, bringing humor and understanding to my work. These values give me far more satisfaction than fitting into the small box an employer needs each quarter, so I set out to build a business that embraces them, with as little BS as possible.

What mentally freed me to arrive at this point was letting go of the need to impress people who didn’t understand my tools, my craft, or my skill set. While that path works for some, it left me feeling unheard and used. Instead of building software to do something interesting, I chose to build software that is, itself, interesting—a kind of art for art’s sake and my personal rebellion against a system that seeks to control my time and monetize my output.

I landed on a service business because I crave 1:1 connection. My favorite moments in my 9-5 weren’t building products but nerding out with a colleague over an obscure programming language feature. Tutoring proved I could make non-zero dollars doing what I love most. Today, I’m expanding this into my own brand and platform, where creativity and emphasis on the individual can shine—free from high platform fees and other external constraints.

From Scratch Code is for people who already know how to write code and want to learn how to write even better code. Who want to build their own libraries in Rust and Python and understand how programming languages and computers work together under the hood. Who want to have a technical support system which takes not being serious very seriously. I’ll continue to work with students and beginner developers on Wyzant, but here, you’ll find a space where creativity and curiosity are the main drivers.

If any of this resonates with you, I encourage you to sign up for my email list. There, I’ll be telling silly stories about the Rust and Python code I’m building—like my current interpreter project—and the things we could learn together, either through mentorship or courses. I’ll continue to discuss the mental health and adult-diagnosed autism side of my story on From Scratch Press. I can’t think of a better way to fully present the two sides of myself to the modern economy than by maintaining parallel newsletters!

I’ll wrap up with this: my inability to form even a single sentence of what feels like BS played a large role in my decision to leave corporate, so everything I’m building with From Scratch Code is genuine and designed to help you thrive in your technical work. On the flip side, I’m a firm believer that humor and creative absurdism can make people smile and expand what’s considered possible. As such, I found myself with no patience for people who (or systems which encourage people to) say “I’m working with person A on project B” when everyone in the room knows person A doesn’t respond to emails and project B will be scrapped. Perhaps this impatience with non-reality is an autism thing. I’m drawn to the person who says “what if we built project C on the moon?!” Those are the people stretching boundaries and refusing to live in the small boxes corporate life often imposes.

I want to be that person for you, your career, and your technical work. Your unlicensed technical therapist. A supportive listener who doesn’t take insurance but can debug your code.

Feel free to share this message with anyone who might enjoy some offbeat creativity alongside their technical growth—I’d love to connect with them!

This is going to be fun. I hope you’ll join me!

The event concluded ten minutes after it began.

James Beach covers culture and satire in The Big City. He lives in The Big City alongside the rest of the literati. He is in no way related to Jones Beach and believes refreshments at press conferences pose an ethical dilemma.

This is cross-posted on From Scratch Press.

A REPL for fat-finger friendly typing

Mon, 21 Oct 2024 00:00:00 GMT

My Python interpreter, Memphis, has a REPL (read-eval-print loop)!

This is old news. As long as you made zero mistakes while interacting with the wise old owl 🦉, you could interpret to your heart’s content. Assuming you never wanted to evaluate the same statement twice, or if you did, didn’t mind retyping it. Also with zero mistakes.

I was perfectly content with this REPL. Thrilled even. I had written home about this REPL. But my bosses’s bosses’ bossi demanded we improve the REPL for the bottom line for the people. They called me into their office and nodded me into the chair across from their mahogany desk. “Some users are expecting the backspace key to work.” TO HELL WITH THE USERS! “The up arrow should bring up their last command.” ARROW KEY SUPPORT IS SO NINETIES! “Can you have this done by the end of Q5?” I QUIT!

So I went back to my desk and improved the REPL.

I improved it so much that all the keys worked. The backspace key, the up arrow, the down arrow, the left arrow, and last, but not least, the backspace arrow. An accountant could have a field day with the new REPL. tick tick tick tick tick tick tick tick tick. That’s the accountant typing numbers, not a bomb slowly diffusing.

I sent the REPL down to the lab and told my main machinist to put a rush job on this order. It’s the REPL I said and from the look in their eyes I could tell they understood. 750ms later the build was complete and we had arrow key support. I took the product back to the big wigs, begged for my job back, and asked them what they thought. They ran a few commands, printed some prints, and added some adds. They made a mistake and hit the backspace key. I rolled my eyes because seriously who makes mistakes but they seemed satisfied. They realized they didn’t want to run a long command they had already typed out and this is where my life went to hell in a handbasket. They. hit. Ctrl. C. Seriously, who does that?! You know that ends the current process, right? RIGHT???

“We need Ctrl-C support by the end of next year.” These people and their demands. I would add Ctrl-C support. But it absolutely would not be within the next two years.

So I went back to my desk and added Ctrl-C support.

What made this REPL worthy of people with fat-fingered tendencies?

Would I be a tool?

I have staked my entire professional and financial future on building things “from scratch,” so I faced a quandary on day 1 of this project. I chose to use crossterm for the key detection primarily because of the cross-platform support. Honestly though, crossterm was very, very good. The API is intuitive and I was especially pleased with KeyModifiers (which we needed to handle Ctrl-C, which I thought was unnecessary, see above).

Raw mode is a pain

We needed it so that the terminal wouldn't handle special keys for us. But damn, I didn't realize it would turn our screen into a malfunctioning typewriter. Anyway, I had to normalize all strings to add a carriage return before any newline characters. Which worked fine and I'm THRILLED about it.

/// When the terminal is in raw mode, we must emit a carriage return in addition to a newline,
/// because that does not happen automatically.
fn normalize<T: Display>(err: T) -> String {
    let formatted = format!("{}", err);
    if terminal::is_raw_mode_enabled().expect("Failed to query terminal raw mode") {
        formatted.replace("\n", "\n\r")
    } else {
        formatted.to_string()
    }
}

/// Print command which will normalize newlines + carriage returns before printing.
fn print_raw<T: Display>(val: T) {
    print!("{}", normalize(val));
    io::stdout().flush().expect("Failed to flush stdout");
}

Integration testing was fun

Under my old REPL (which I preferred, see above), I could test it integration-ally by just running the binary and passing in some Python code to stdin. That stopped working when using crossterm I think because of a contract dispute. I honestly can’t explain it fully, but event::read() would timeout and fail in the integration test provided with stdin input. So I mocked it.

pub trait TerminalIO {
    fn read_event(&mut self) -> Result<Event, io::Error>;
    fn write<T: Display>(&mut self, output: T) -> io::Result<()>;
    fn writeln<T: Display>(&mut self, output: T) -> io::Result<()>;
}

/// A mock for testing that doesn't use `crossterm`.
struct MockTerminalIO {
    /// Predefined events for testing
    events: Vec<Event>,

    /// Captured output for assertions
    output: Vec<String>,
}

impl TerminalIO for MockTerminalIO {
    fn read_event(&mut self) -> Result<Event, io::Error> {
        if self.events.is_empty() {
            Err(io::Error::new(io::ErrorKind::Other, "No more events"))
        } else {
            // remove from the front (semantically similar to VecDequeue::pop_front).
            Ok(self.events.remove(0))
        }
    }

    fn write<T: Display>(&mut self, output: T) -> io::Result<()> {
        self.output.push(format!("{}", output));
        Ok(())
    }

    fn writeln<T: Display>(&mut self, output: T) -> io::Result<()> {
        self.write(output)?;
        self.write("\n")?;
        Ok(())
    }
}

Which resulted in the whole thing becoming a unit test? Honestly I don’t know. At this point, I call it an integration test if I either a) call a binary inside another binary, or 2) launch a server / open a port / listen on a socket inside a test. If you have another definition you’d like to leave in the comments, please don’t because that sounds annoying TBH.

/// Run the complete flow, from input code string to return value string. If you need any Ctrl
/// modifiers, do not use this!
fn run_and_return(input: &str) -> String {
    let mut terminal = MockTerminalIO::from_str(input);
    Repl::new().run(&mut terminal);
    terminal.return_val()
}

fn string_to_events(input: &str) -> Vec<Event> {
    input
        .chars()
        .map(|c| {
            let key_code = match c {
                '\n' => KeyCode::Enter,
                _ => KeyCode::Char(c),
            };
            Event::Key(KeyEvent::new(key_code, KeyModifiers::NONE))
        })
        .collect()
}

We can now test these common scenarios with fairly little boilerplate.

#[test]
fn test_repl_name_error() {
    let return_val = run_and_return("e\n");
    assert!(return_val.contains("NameError: name 'e' is not defined"));
}

#[test]
fn test_repl_expr() {
    let third_from_last = run_and_return("12345\n");
    assert_eq!(third_from_last, "12345");
}

#[test]
fn test_repl_statement() {
    let return_val = run_and_return("a = 5.5\n");

    // empty string because a statement does not have a return value
    assert_eq!(return_val, "");
}

#[test]
fn test_repl_function() {
    let code = r#"
def foo():
    a = 10
    return 2 * a

foo()
"#;
    let return_val = run_and_return(code);
    assert_eq!(return_val, "20");
}

#[test]
fn test_repl_ctrl_c() {
    let mut events = string_to_events("123456789\n");
    let ctrl_c = Event::Key(KeyEvent::new(KeyCode::Char('c'), KeyModifiers::CONTROL));
    events.insert(4, ctrl_c);
    let mut terminal = MockTerminalIO::new(events);

    Repl::new().run(&mut terminal);
    assert_eq!(terminal.return_val(), "56789");
}

Code entrypoints get me out of bed in the morning

One of my motivations in adding a REPL at all was because I believe you make your code better when you add a second entrypoint. You are essentially becoming the second user for your library, which helps you get closer to understanding The One Perfect Abstraction we are all poking at our keyboards in search of. I mean this point earnestly.

“Zero Dependencies” 😉

The REPL is now behind a feature flag as a way to get back at management. I am keeping alive the ability to interpret Python code with the help of zero third-party crates, which means crossterm would either need to be an exception or I would introduce a feature flag. Now, if you compile without the REPL enabled and run “memphis”, it will politely tell you “wrong build, dumbass.”

Goodbye

The REPL is here. You can run it like this. If you want to buy it, that sounds like a scam. Be well & Talk soon.

Declarative macro magic from Axum in Rust

Mon, 07 Oct 2024 00:00:00 GMT

Take a quick glance at the code snippet below. Without thinking too hard, is get a function or a method? How about post?

Router::new()
    .route(“/one”, get(get_handler).post(post_handler))
    .route(“/two”, post(post_handler).get(get_handler))

What did you decide: is get a function or a method? It appears to be both! And the same with post!

If you are familiar with API development in Rust, you may recognize this syntax from Axum. This puzzling question piqued my curiosity and led me to build Cairo, where I re-implemented some key Axum concepts in a simpler way as a learning exercise. I’ve long been fascinated by the intermediate + advanced concepts library authors employ to make their interfaces feel frictionless. Rust sits in a brilliant space because of the way it melds high-level expressiveness with low-level control and performance. One of the ways it accomplishes this is using macros.

What is a macro in Rust?

Macros in Rust come in two key forms: declarative and procedural. This post will focus on declarative, but let’s briefly define both.

A declarative macro in Rust uses the function!(..) syntax, which you may be familiar with from print!(“Hello World!”) or panic!(“at the disco!”). It is a form of metaprogramming which allows developers to reduce boilerplate code. It does this by evaluating the declarative macro at compile time, which produces more Rust code which ultimately ends up in the binary of the program.

If you have wondered why print!(..) in Rust is a macro, it is due to its dynamic nature. All of these are valid Rust:

println!("Hello");
println!("Hello {}", "World");
println!("Hello {}{}", "World", "!");

In a ~~language with rules~~ strongly-typed language, a traditional function would typically not support a variable number of parameters like this (also called variable-arity). However, Rust’s declarative macros use a $(..),* syntax to support variable-arity by expanding to different code based on the number of parameters passed. The compiled code may end up calling different function implementations or generating specific code for each unique pattern of parameters.

A procedural macro in Rust uses the #[...] syntax, which you may have seen in #[tokio:main]. This macro is applied to a Rust function or struct, allowing the macro to modify the syntax tree of the annotated item at compile time. While it often helps reduce boilerplate code, it does so by transforming the function or struct as a whole rather than directly inserting statements into the function body. For example, #[tokio::main] wraps the entire main function in a tokio async runtime, enabling it to block on the execution of async code in main.

Code without macros can be repetitive and say the same thing in multiple ways

Axum is one of the most popular web frameworks in Rust. Its compatibility with the Tokio ecosystem and its powerful syntax, among other features, keep it near the front of the pack.

Skipping back to my original bafflement at whether get is a function or a method, the answer is both and it does this using a declarative macro.

If we look at our snippet of interest, get must be a function available in the outer scope and a method on whatever type is returned by invoking post. Similarly, post must be a function available in the outer scope and a method on whatever type is returned by invoking get. This symmetry pleases me. Let’s write some code that accomplishes exactly that and nothing more.

use std::collections::HashMap;

/// Define a trait with a single method `handle`.
/// This trait allows different types to implement handler logic, which is useful in API routing
/// systems.
trait Handler {
    fn handle(&self);
}

/// Define an enum representing HTTP methods.
/// Deriving `Hash`, `Eq`, and `PartialEq` allows this type to be used as a `HashMap` key.
#[derive(Hash, Eq, PartialEq, Debug)]
enum Method {
    Get,
    Post,
}

/// Start of a method-chaining function for the `GET` HTTP method.
fn get<H: Handler + 'static>(handler: H) -> InnerType {
    InnerType::new().on(Method::Get, handler)
}

/// Start of a method-chaining function for the `POST` HTTP method.
fn post<H: Handler + 'static>(handler: H) -> InnerType {
    InnerType::new().on(Method::Post, handler)
}

/// Define a struct that holds a collection of routes. We give it a silly name here to emphasize
/// this is internal to our Axum-like library. Specifically, each `Router` would hold several of
/// these struct instances, each mapped to a specific string-pattern representing a route. This is
/// outside the scope of this example, please see [Cairo](https://github.com/JonesBeach/cairo) if
/// you are interested to learn more.
struct InnerType {
    /// `Box<dyn Handler>` enables dynamic dispatch, allowing for different handler types in the
    /// `HashMap`.
    routes: HashMap<Method, Box<dyn Handler>>,
}

impl InnerType {
    /// Initialize our empty instance.
    fn new() -> Self {
        Self {
            routes: HashMap::default(),
        }
    }

    /// Add a `Handler` for a specific HTTP method to the routes map.
    fn on<H: Handler + 'static>(mut self, method: Method, handler: H) -> Self {
        self.routes.insert(method, Box::new(handler));
        self
    }

    /// Method-chaining function for the `GET` HTTP method.
    fn get<H: Handler + 'static>(self, handler: H) -> Self {
        self.on(Method::Get, handler)
    }

    /// Method-chaining function for the `POST` HTTP method.
    fn post<H: Handler + 'static>(self, handler: H) -> Self {
        self.on(Method::Post, handler)
    }
}

At this point, we see some duplication, but the boilerplate isn’t overwhelmingly negative. However, what about when we add support for the remaining HTTP verbs: OPTIONS, HEAD, PUT, DELETE, PATCH? We’d find ourselves maintaining 7 functions and 7 methods. Given the choice between maintaining 14 blocks of code versus adding complexity to prove I know a language, I'll choose the latter every single time. Enter declarative macros.

Declarative Macros in Axum

The code below is a mix of Axum and Cairo. We introduce two macros: add_http_function and add_http_method.

/// A declarative macro which will define a function `$name` which accepts a `Handler` to be
/// registered to a particular HTTP `Method::$method`.
macro_rules! add_http_function {
    (
        $name:ident, $method:ident
    ) => {
        fn $name<H: Handler + 'static>(handler: H) -> InnerType {
            on(Method::$method, handler)
        }
    };
}

/// A declarative macro which will define a method `$name` which accepts `self` and a `Handler` to
/// be registered to a particular HTTP `Method::$method`.
macro_rules! add_http_method {
    (
        $name:ident, $method:ident
    ) => {
        fn $name<H: Handler + 'static>(self, handler: H) -> Self {
            self.on(Method::$method, handler)
        }
    };
}

You'll notice that these both accept two ident parameters, a $name and a $method. The $name becomes the function name, which will be get, post, etc., while the $method is concatenated to Method::, meaning we must give a valid Method enum variant. It's okay if this is confusing at first—metaprogramming is a different way of thinking about programming. Instead of asking "what should my code do?" we are now asking "what code should my code produce?"

You'll also notice that while they both define a "function" with the name $name, add_http_method accepts a parameter self. This means we must invoke our macro (by calling add_http_method!(get, Get)) in a context which is aware of a self. For us, this will be inside our impl InnerType block.

Putting it all together

Putting it all together, here is our new library with minimal boilerplate and support for method chaining for 7 HTTP methods.

use std::collections::HashMap;

/// Define a trait with a single method `handle`.
/// This trait allows different types to implement handler logic, which is useful in API routing
/// systems.
trait Handler {
    fn handle(&self);
}

/// Define an enum representing HTTP methods.
/// Deriving `Hash`, `Eq`, and `PartialEq` allows this type to be used as a `HashMap` key.
#[derive(Hash, Eq, PartialEq, Debug)]
enum Method {
    Get,
    Post,
    Options,
    Head,
    Put,
    Delete,
    Patch,
}

/// Define a function to be called by our `add_http_function` macro. For a given HTTP method, this
/// function will register a handler and return an type which supports method chaining.
fn on<H: Handler + 'static>(method: Method, handler: H) -> InnerType {
    InnerType::new().on(method, handler)
}

/// A declarative macro which will define a function `$name` which accepts a `Handler` to be
/// registered to a particular HTTP `Method::$method`.
macro_rules! add_http_function {
    (
        $name:ident, $method:ident
    ) => {
        fn $name<H: Handler + 'static>(handler: H) -> InnerType {
            on(Method::$method, handler)
        }
    };
}

// Invoke our declarative macro once for each HTTP `Method`.
add_http_function!(get, Get);
add_http_function!(post, Post);
add_http_function!(delete, Delete);
add_http_function!(head, Head);
add_http_function!(options, Options);
add_http_function!(patch, Patch);
add_http_function!(put, Put);

/// A declarative macro which will define a method `$name` which accepts `self` and a `Handler` to
/// be registered to a particular HTTP `Method::$method`.
macro_rules! add_http_method {
    (
        $name:ident, $method:ident
    ) => {
        fn $name<H: Handler + 'static>(self, handler: H) -> Self {
            self.on(Method::$method, handler)
        }
    };
}

/// Define a struct that holds a collection of routes. We give it a silly name here to emphasize
/// this is internal to our Axum-like library. Specifically, each `Router` would hold several of
/// these struct instances, each mapped to a specific string-pattern representing a route. This is
/// outside the scope of this example, please see [Cairo](https://github.com/JonesBeach/cairo) if
/// you are interested to learn more.
struct InnerType {
    /// `Box<dyn Handler>` enables dynamic dispatch, allowing for different handler types in the
    /// `HashMap`.
    routes: HashMap<Method, Box<dyn Handler>>,
}

impl InnerType {
    /// Initialize our empty instance.
    fn new() -> Self {
        Self {
            routes: HashMap::default(),
        }
    }

    /// Add a `Handler` for a specific HTTP method to the routes map. This will be invoked from our
    /// `add_http_method` macro.
    fn on<H: Handler + 'static>(mut self, method: Method, handler: H) -> Self {
        self.routes.insert(method, Box::new(handler));
        self
    }

    // Invoke our declarative macro once for each HTTP `Method`.
    add_http_method!(get, Get);
    add_http_method!(post, Post);
    add_http_method!(delete, Delete);
    add_http_method!(head, Head);
    add_http_method!(options, Options);
    add_http_method!(patch, Patch);
    add_http_method!(put, Put);
}

The End

While we've only scratched the surface of the metapossibilities of Rust's declarative macros, I hope this post has inspired you to explore more of what Rust can do at compile time. Procedural macros are another wonderful beast which I would encourage you to dig into if you are interested in ASTs and language development.

If you are curious about more ways Axum and APIs work under-the-hood, I encourage you to check out my course HTTP Server in Rust. The final module is public as the repo Cairo, which shows the north star you will build towards throughout the course. In addition to the macro usage we described here, you’ll explore the details of HTTP and how to create a Router which accepts handlers with variable parameters and return types using advanced type erasure.

Now it's your turn! I'd love your perspective in the comments on these two questions:

How have you used declarative macros in your own Rust projects to reduce boilerplate code?
Are there any Rust crates you have used with an interfaces that makes you go "how did they do that?!"

From Scratch Code RSS Feed

I'm embarrassed by how much code I cut from my test suite

Designing an expressive parser test suite

Expressing Python types

Expressing operations

Wrapping the parser entrypoint

Wrapping the happy path error handling

The End

I left corporate and still do roadmaps + a Memphis update

Q1 Roadmap

Multiple Execution Engines

Common Entrypoint

Common Return Type

Common Debug Stack Trace

The End

Build a software career with meaning: a playbook

Building a Markdown blog with links optimized for Gatsby

Background

List of Requirements

Supporting Markdown Posts

Enabling Syntax Highlighting

First Attempt: gatsby-remark-prismjs

Optimizing Internal Links

Second Attempt: Switching to Rehype

What’s next?

Typed integers in Rust for safer Python bytecode compilation

The problem

The solution

The end

How I added support for nested functions in Python bytecode

Why we use bytecode in the first place

Context matters for Python variables

LOAD_GLOBAL versus LOAD_FAST

What’s in a varname?

What’s next for Memphis?

Improving memory efficiency in a working interpreter

Identifying and avoiding cloned data

The new parser

The new Builder: MemphisContext

Was this worth it?

An interpreter inside an interpreter

Python stdlib

Importing modules

Converting objects and getting existential

The End

Building for WebAssembly

Python in the browser

Setting Up Wasm-Pack and Building for WebAssembly

Creating a JavaScript Interface

Debugging WebAssembly Errors

The End

Introducing: From Scratch Code

A REPL for fat-finger friendly typing

What made this REPL worthy of people with fat-fingered tendencies?

Would I be a tool?

Raw mode is a pain

Integration testing was fun

Code entrypoints get me out of bed in the morning

“Zero Dependencies” 😉

Goodbye

Declarative macro magic from Axum in Rust

What is a macro in Rust?

Code without macros can be repetitive and say the same thing in multiple ways

Declarative Macros in Axum

Putting it all together

The End

First Attempt: `gatsby-remark-prismjs`