# Miscellaneous notes Here you'll find my miscellaneous, mostly-unsorted notes on various topics. # Project ideas Various ideas for projects that I do not yet have the time, knowledge or energy to work on. Feel free to take these ideas if they seem interesting, though please keep them non-commercial and free of ads! # Automatic authentication keys **Problem:** Every website needs you to create an account. This is a pain to manage, and a barrier. This is especially problematic for self-hosted things like Forgejo, because it gives centralized platforms an advantage (everyone already has an account there). It should be trivial to immediately start using a site without going through a registration process. **Other solutions:** OIDC, OpenID and such all require you to have an account with a provider. You fully trust this provider with your access, or you need to self-host it which is extra work. Passkeys are extremely Google-shaped and dubiously designed and documented. Federation is a massively complex solution to design for, and really an unnecessary complexity expense for the vast majority of self-hosted cases. **Proposed solution:** Authentication directly integrated into browser through a browser extension. It uses request interception APIs and such to detect "is supported" headers from websites, and inject authentication headers into requests upon confirmation from the user that they wish to authenticate (it should not disclose its existence before that point). Authentication is done through keys managed locally by the browser and optionally stored encrypted on a third-party server. **Unsolved issues:** Key management and backup, making it robust. Offer to backup to a USB key? How to deal with Manifest v3 in Chrome? # New Page # Javascript Anything about Javascript in general, that isn't specific to Node.js. # Whirlwind tour of (correct) npm usage

This article was originally published at [https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2).

This is a quick tour of how to get started with NPM, how to use it, and how to fix it.

I'm available for [tutoring and code review](http://cryto.net/~joepie91/code-review.html) :)

### Starting a new project[](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#starting-a-new-project) Create a folder for your project, preferably a Git repository. Navigate into that folder, and run: ```bash npm init ``` It will ask you a few questions. Hit `Enter` without input if you're not sure about a question, and it will use the default. You now have a `package.json`.

*If you're using Express:* Please don't use `express-generator`. It sucks. Just use `npm init` like explained above, and follow the 'Getting Started' and 'Guide' sections on the [Express website](http://expressjs.com/). They will teach you all you need to know when starting from scratch.

### Installing a package[](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#installing-a-package) All packages in NPM are *local* - that is, specific to the project you install it in, and actually installed *within* that project. They are also nested - if you use the `foo` module, and `foo` uses the `bar` module, then you will have a `./node_modules/foo/node_modules/bar`. This means you pretty much never have version conflicts, and can install as many modules as you want without running into issues. All modern versions of NPM will 'deduplicate' and 'flatten' your module folder as much as possible to save disk space, but as a developer you don't have to care about this - it will still work like it's a tree of nested modules, and you can still assume that there will be no version conflicts. You install a package like this: ```bash npm install packagename ``` While the packages themselves are installed in the `node_modules` directory (as that's where the Node.js runtime will look for them), that's only a temporary install location. The *primary* place where your dependencies are defined, should be in your `package.json` file - so that they can be safely updated and reinstalled later, even if your `node_modules` gets lost or corrupted somehow. In older versions of npm, you had to manually specify the `--save` flag to make sure that the package is saved in your `package.json`; that's why you may come across this in older articles. However, modern versions of NPM do this automatically, so the command above should be enough. One case where you *do* still need to use a flag, is when you're installing a module that you just need for developing your project, but that isn't needed when actually *using* or *deploying* your project. Then you can use the `--save-dev` flag, like so: ```bash npm install --save-dev packagename ``` Works pretty much the same, but saves it as a development dependency. This allows a user to install just the 'real' dependencies, to save space and bandwidth, if they just want to use your thing and not modify it. To install everything that is declared in `package.json`, you just run it without arguments: ```bash npm install ``` When you're using Git or another version control system, you should add `node_modules` to your ignore file (eg. `.gitignore` for Git); this is because *installed* copies of modules may need to be different depending on the system. You can then use the above command to make sure that all the dependencies are correctly installed, after cloning your repository to a new system. ### Semantic versioning[](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#semantic-versioning) Packages in NPM usually use semantic versioning; that is, the changes in a version number indicate what has changed, and whether the change is breaking. Let's take **1.2.3** as an example version. The components of that version number would be: - **Major version number:** 1 - **Minor version number:** 2 - **Patch version number:** 3 Depending on which number changes, there's a different kind of change to the module: - **Patch version upgrade (eg. `1.2.3` -> `1.2.4`):** An internal change was made, but the API hasn't changed. It's safe to upgrade. - **Minor version upgrade (eg. `1.2.3` -> `1.3.0`):** The API has changed, but in a backwards-compatible manner - for example, a new feature or option was added. It's safe to upgrade. You may still want to read the changelog, in case there's new features that you want to use, or that you were waiting for. - **Major version upgrade (eg. `1.2.3` -> `2.0.0`):** The API has changed, and is not backwards-compatible. For example, a feature was removed, a default was changed, and so on. It is **not** safe to upgrade. You first need to read the changelog, to see whether the changes affect your application. Most NPM packages follow this, and it gives you a lot of certainty in what upgrades are safe to carry out, and what upgrades aren't. NPM explicitly adopts semver in its package.json as well, by introducing a few special version formats: - **`~1.2.3`:** Allow automatic patch upgrades, but not minor or major upgrades. Upgrading to `1.2.4` is allowed, but upgrading to `1.3.0` or `2.0.0` is not. You still can't downgrade below `1.2.3` - for example, `1.2.2` is *not* allowed. - **`^1.2.3`:** Allow automatic patch and minor upgrades, but not major upgrades. Upgrading to `1.2.4` or `1.3.0` is allowed, but upgrading to `2.0.0` is not. You still can't downgrade below `1.2.3` - for example, `1.2.2` or `1.1.0` are *not* allowed. - **`1.2.3`:** Require this specific version. No upgrades are allowed. You will rarely need this - only for misbehaving packages, really. - **`*`:** Allow upgrades to whatever the latest version is. You should **never** use this. By default, NPM will automatically use the **^1.2.3** notation, which is usually what you want. Only configure it otherwise if you have an explicit reason to do so. A special case are `0.x.x` versions - these are considered to be 'unstable', and the rules are slightly different: the *minor* version number indicates a breaking change, rather than the major version number. That means that `^0.1.2` will allow an upgrade to `0.1.3`, but *not* to `0.2.0`. This is commonly used for pre-release testing versions, where things may wildly change with every release. If you end up publishing a module yourself (and you most likely eventually will), then definitely adhere to these guidelines as well. They make it a lot easier for developers to keep dependencies up to date, leading to considerably less bugs and security issues. ### Global modules[](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#global-modules) Sometimes, you want to install a command-line utility such as [`peerflix`](https://www.npmjs.com/package/peerflix), but it doesn't belong to any particular project. For this, there's the `--global` or `-g` flag: ```bash npm install -g peerflix ``` If you used packages from your distribution to install Node, you may have to use `sudo` for global modules. **Never, ever, ever use global modules for project dependencies, ever.** It may seem 'nice' and 'efficient', but you will land in dependency hell. It is not possible to enforce semver constraints on global modules, and things will spontaneously break. All the time. Don't do it. Global modules are *only for project-independent, system-wide, command-line tools*. **This applies even to development tools for your project.** Different projects will often need different, incompatible versions of development tools - so those tools should be installed *without* the global flag. For local packages, the binaries are all collected in `node_modules/.bin`. You can then run the tools like so: ```bash ./node_modules/.bin/eslint ``` ### NPM is broken, and I don't understand the error![](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#npm-is-broken-and-i-dont-understand-the-error) The errors that NPM shows are usually not very clear. I've written a tool that will analyze your error, and try to explain it in plain English. It can be found [here](http://cryto.net/why-is-npm-broken/). ### My dependencies are broken![](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#my-dependencies-are-broken) If you've just updated your Node version, then you may have native (compiled) modules that were built against the old Node version, and that won't work with the new one. Run this to rebuild them: ```bash npm rebuild ``` ### My dependencies are still broken![](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2#my-dependencies-are-still-broken) Make sure that all your dependencies are declared in `package.json`. Then just remove and recreate your `node_modules`: ```bash rm -rf node_modules npm install ``` # An overview of Javascript tooling

This article was originally published at [https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d).

Getting confused about the piles of development tools that people use for Javascript? Here's a quick index of what is used for what. **Keep in mind that you shouldn't add tools to your workflow for the sake of it.** While you'll see many production systems using a wide range of tools, these tools are typically used because they solved a *concrete problem* for the developers working on it. You should **not** add tools to your project unless you have a concrete problem that they can solve; none of the tools here are *required*. Start with nothing, and add tools as needed. This will keep you from getting lost in an incomprehensible pile of tooling. ### Build/task runners[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#buildtask-runners) **Typical examples:** Gulp, Grunt These are not exactly build tools in and of themselves; they're rather just used to glue together *other* tools. For example, if you have a set of build steps where you need to run tool A after tool B, a build runner can help to orchestrate those tools. ### Bundlers[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#bundlers) **Typical examples:** Browserify, Webpack, Parcel These tools take a bunch of `.js` files that use [modules](https://nodejs.org/api/modules.html) (either CommonJS using `require()` statements, or ES Modules using `import` statements), and combine them into a *single* `.js` file. Some of them also allow specifying 'transformation steps', but their main purpose is bundling. Why does bundling matter? While in Node.js you have access to a module system that lets you load files as-needed from disk, this wouldn't be practical in a browser; fetching every file individually over the network would be very slow. That's why people use a bundler, which effectively does all this work upfront, and then produces a single 'combined' file with all the same guarantees of a module system, but that can be used in a browser. Bundlers can also be useful for running module-using code in very basic JS environments that don't have module support for some reason; this includes Google Sheets, extensions for PostgreSQL, GNOME, and so on. **Bundlers are *not* transpilers.** They do not compile one language to another, and they don't "make ES6 work everywhere". Those are the job of a *transpiler*. Bundlers are sometimes configured to *use* a transpiler, but the transpiling itself isn't done by the bundler. **Bundlers are *not* task runners.** This is an especially popular misconception around Webpack. Webpack does *not* replace task runners like Gulp; while Gulp is designed to glue together arbitrary build tasks, Webpack is specifically designed for *browser bundles*. It's commonly useful to use Webpack *with* Gulp or another task runner. ### Transpilers[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#transpilers) **Typical examples:** Babel, the TypeScript compiler, CoffeeScript These tools take a bunch of code in one language, and 'compile' it to another language. They're called commonly 'transpilers' rather than 'compilers' because unlike traditional compilers, these tools don't compile to a lower-level representation; they're just different languages at a similar level of abstraction. These are typically used to run code written against newer JS versions in older JS runtimes (eg. Babel), or to provide custom languages with more conveniences or constraints that can then be executed in any regular JS environment (TypeScript, CoffeeScript). ### Process restarters[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#process-restarters) **Typical examples:** nodemon These tools automatically restart your (Node.js) process when the underlying code is changed. This is used for development purposes, to remove the need to manually restart your process every change. A process restarter may either watch for file changes itself, or be controlled by an external tool like a build runner. ### Page reloaders[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#page-reloaders) **Typical examples:** LiveReload, BrowserSync, Webpack hot-reload These tools automatically refresh a page in the browser and/or reload stylesheets and/or re-render parts of the page, to reflect the changes in your *browser-side* code. They're kind of the equivalent of a process restarter, but for webpages. These tools are usually externally controlled; typically by either a build runner or a bundler, or both. ### Debuggers[](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d#debuggers) **Typical examples:** Chrome Developer Tools, node-inspect These tools allow you to inspect *running* code; in Node.js, in your browser, or both. Typically they'll support things like pausing execution, stepping through function calls manually, inspecting variables, profiling memory allocations and CPU usage, viewing execution logs, and so on. They're typically used to find tricky bugs. It's a good idea to learn how these tools work, but often it'll still be easier to find a bug by just 'dumb logging' variables throughout your code using eg. `console.log`. # Monolithic vs. modular - what's the difference?

This article was originally published at [https://gist.github.com/joepie91/7f03a733a3a72d2396d6](https://gist.github.com/joepie91/7f03a733a3a72d2396d6).

When you're developing in Node.js, you're likely to run into these terms - "monolithic" and "modular". They're usually used to describe the different types of frameworks and libraries; not just HTTP frameworks, but modules in general. ### At a glance - **Monolithic:** "Batteries-included" and typically tightly coupled, it tries to include all the stuff that's needed for common usecases. An example of a monolithic web framework would be [Sails.js](http://sailsjs.org/). - **Modular:** "Minimal" and loosely coupled. Only includes the bare minimum of functionality and structure, and the rest is a plugin. Fundamentally, it generally only has a single 'responsibility'. An example of a modular web framework would be [Express](http://expressjs.com/). ### Coupled?[](https://gist.github.com/joepie91/7f03a733a3a72d2396d6#coupled) In software development, the terms "tightly coupled" and "loosely coupled" are used to indicate how much components rely on each other; or more specifically, how many assumptions they make about each other. This directly translates to how easy it is to replace and change them. - **Tightly coupled:** Highly cohesive code, where every part of the code makes assumptions about every other part of the code. - **Loosely coupled:** Very "separated" code, where every part of the code communicates with other parts through more-or-less standardized and neutral interfaces. While tight coupling can sometimes result in slightly more performant code and very occasionally makes it easier to build a 'mental model', loosely coupled code is much easier to understand and maintain - as the inner workings of a component are separated from its interface or API, you can make many more assumptions about how it behaves. Loosely coupled code is often centered around 'events' and data - a component 'emits' changes that occur, with data attached to them, and other components may optionally 'listen' to those events and do something with it. However, the emitting component has no idea who (if anybody!) is listening, and cannot make assumptions about what the data is going to be used for. What this means in practice, is that loosely coupled (and modular!) code rarely needs to be changed - once it is written, has a well-defined set of events and methods, and is free of bugs, it no longer needs to change. If an application wants to start using the data differently, it doesn't require changes in the component; the data is still of the same format, and the application can simply process it differently. This is only one example, of course - loose coupling is more of a practice than a pattern. The exact implementation depends on your usecase. A quick checklist to determine how loosely coupled your code is: - [ ] **Does your component rely on external state?** This is an absolute no-no. Your component cannot rely on any state outside of the component itself. It may not make any assumptions about the application *whatsoever*. Don't even rely on configuration files or other filesystem files - all such data must be passed in by the application explicitly, always. What isn't in the component itself, doesn't exist. - [ ] **How many assumptions does it make about how the result will be used?** Loosely coupled code shouldn't care about how its output will be used, whether it's a return value or an event. The output just needs to be consistent, documented, and neutral. - [ ] **How many custom 'types' are used?** Loosely coupled code should generally only accept objects that are defined on a language or runtime level, and in common use. Arrays and A+ promises are fine, for example - a proprietary representation of an ongoing task is not. - [ ] **If you *need* a custom type, how simple is it?** If absolutely needed, your custom object type should be as plain as possible - just a plain Javascript object, optimally. It should be well-documented, and not duplicate an existing implementation to represent this kind of data. Ideally, it should be defined in a separate project, just for documenting the type; that way, others can implement it as well. In this section, I've used the terms "component" and "application", but these are interchangeable with "callee"/"caller", and "provider"/"consumer". The principles remain the same. ### The trade-offs[](https://gist.github.com/joepie91/7f03a733a3a72d2396d6#the-trade-offs) At first, a monolithic framework might look easier - after all, it already includes everything you think you're going to need. In the long run, however, you're likely to run into situations where the framework just doesn't *quite* work how you want it to, and you have to spend time trying to work around it. This problem gets worse if your usecase is more unusual - because the framework developers didn't keep in mind your usecase - but it's a risk that always exists to some degree. Initially, a modular framework might look harder - you have to figure out what components to use for yourself. That's a one-time cost, however; the majority of modules are reusable across projects, so after your first project you'll have a good idea of what to start with. The remaining usecase-specific modules would've been just as much of a problem in a monolithic framework, where they likely wouldn't have existed to begin with. Another consideration is the possibility to 'swap out' components. What if there's a bug in the framework that you're unable (or not allowed) to fix? When building your application modularly, you can simply get rid of the offending component and replace it with a different one; this usually doesn't take more than a few minutes, because components are typically small and only do one thing. In a monolithic framework, this is more problematic - the component is an inherent part of the framework, and replacing it may be impossible or extremely hard, depending on how many assumptions the framework makes. You will almost certainly end up implementing a workaround of some sort, which can take *hours*; you need to understand the framework's codebase, the component you're using, and the exact reason why it's failing. Then you need to write code that works around it, sometimes even having to 'monkey-patch' framework methods. Relatedly, you may find out halfway through the project that the framework doesn't support your usecase as well as you thought it would. Now you have to either *replace the entire framework*, or build hacks upon hacks to make it 'work' somehow; well enough to convince your boss or client, anyway. The higher cost for on-boarding new developers (as they have to learn an entire framework, not just the bits you're interested in *right now*), only compounds this problem - now they *also* have to learn why all those workarounds exist. In summary, the tradeoffs look like this: - **Monolithic:** Slightly faster to get started with, but less control over its workings, more chance of the framework not supporting your usecase, and higher long-term maintenance cost due to the inevitable need for workarounds. - **Modular:** Takes slightly longer to get started on your first project, but total control over its workings, practically every usecase is supported, and long-term maintenance is cheaper. ### The "it's just a prototype!" argument[](https://gist.github.com/joepie91/7f03a733a3a72d2396d6#the-its-just-a-prototype-argument) When explaining this to people, a common justification for picking a monolithic framework is that "it's just a prototype!", or "it's just an MVP!", with the implication that it can be changed later. In reality, it usually can't. Try explaining to your boss that you want to throw out the working(!) code you have, and rewrite everything from the ground up in a different, more maintainable framework. The best response that you're likely to get, is your boss questioning why you didn't use that framework to begin with - but more likely, the answer is "no", and you're going to be stuck with your hard-to-maintain monolithic codebase for the rest of the project or your employment, whichever terminates first. Again, the cost of a modular codebase is a one-time cost. After your first project, you already know where to find most modules you need, and building on a modular framework will not be more expensive than building on a monolithic one. Don't fall into the "prototype trap", and do it right from day one. You're likely to be stuck with it for the rest of your employment. # Synchronous vs. asynchronous

This article was originally published at [https://gist.github.com/joepie91/bf3d04febb024da89e3a3e61b164247d](https://gist.github.com/joepie91/bf3d04febb024da89e3a3e61b164247d).

You'll run into the terms "synchronous" and "asynchronous" a lot when working with JS. Let's look at what they actually *mean*. Synchronous code is like what you might be used to already from other languages. You call a function, it does some work, and then returns the result. No other code runs in the meantime. This is simple to understand, but it's also inefficient; what if "doing some work" mostly involves getting some data from a database? In the meantime, our process is sitting around doing nothing, waiting for the database to respond. It could be doing useful work in that time! And that's what brings us to asynchronous code. Asynchronous code works differently; you still call a function, but it *doesn't* return a result. Instead, you don't just pass the regular arguments to the function, but also give it a piece of code in a function (a so-called "asynchronous callback") to execute when the operation completes. The JS runtime stores this callback alongside the in-progress operation, to retrieve and execute it later when the external service (eg. the database) reports that the operation has been completed. Crucially, this means that when you call an asynchronous function, it *cannot* wait until the external processing is complete before returning from the function! After all, the intention is to keep running other code in the meantime, so it needs to return from the function so that the 'caller' (the code which originally called the function) can continue doing useful things even while the external operation is in progress. All of this takes place in what's called the "event loop" - you can pretty much think of it as a huge infinite loop that contains your entire program. Every time you trigger an external process through an asynchronous function call, that external process will eventually finish, and put its result in a 'queue' alongside the callback you specified. On each iteration ("tick") of the event loop, it then goes through that queue, executes all of the callbacks, which can then indirectly cause new items to be put into the queue, and so on. The end result is a program that calls asynchronous callbacks as and when necessary, and that keeps giving new work to the event loop through a chain of those callbacks. This is, of course, a very simplified explanation - just enough to understand the rest of this page. I strongly recommend reading up on the event loop more, as it will make it much easier to understand JS in general. Here are some good resources that go into more depth: 1. [https://nodesource.com/blog/understanding-the-nodejs-event-loop](https://nodesource.com/blog/understanding-the-nodejs-event-loop) (article) 2. [https://www.youtube.com/watch?v=8aGhZQkoFbQ](https://www.youtube.com/watch?v=8aGhZQkoFbQ) (video) 3. [https://www.youtube.com/watch?v=cCOL7MC4Pl0](https://www.youtube.com/watch?v=cCOL7MC4Pl0) (video) Now that we understand the what the event loop is, and what a "tick" is, we can define more precisely what "asynchronous" means in JS: **Asynchronous code is code that happens across more than one event loop tick. An asynchronous function is a function that needs more than one event loop tick to complete.** This definition will be important later on, for understanding why asynchronous code can be more difficult to write correctly than synchronous code. ### Asynchronous execution order and boundaries[](https://gist.github.com/joepie91/bf3d04febb024da89e3a3e61b164247d#asynchronous-execution-order-and-boundaries) This idea of "queueing code to run at some later tick" has consequences for how you write your code. Remember how the event loop is a loop, and ticks are iterations - this means that event loop ticks are *distributed across time linearly*. First the first tick happens, then the second tick, then the third tick, and so on. Something that runs in the first tick can *never* execute before something that runs in the third tick; unless you're a time traveller anyway, in which case you probably would have more important things to do than reading this guide 😃 Anyhow, this means that code will run in a slightly counterintuitive way, if you're used to synchronous code. For example, consider the following code, which uses the asynchronous `setTimeout` function to run something after a specified amount of milliseconds: ```javascript console.log("one"); setTimeout(() => { console.log("two"); }, 300); console.log("three"); ``` You might expect this to print out `one, two, three` - but if you try running this code, you'll see that it doesn't! Instead, you get this: ``` one three two ``` What's going on here?! The answer to that is what I mentioned earlier; the asynchronous callback is *getting queued for later*. Let's pretend for the sake of explanation that an event loop tick only happens when there's actually something to do. The **first tick** would then run this code: ```javascript console.log("one"); setTimeout(..., 300); // This schedules some code to run in a next tick, about 300ms later console.log("three"); ``` Then 300 milliseconds elapse, with nothing for the event loop to do - and after those 300ms, the callback we gave to `setTimeout` suddenly appears in the event loop queue. Now the **second tick** happens, and it executes this code: ```javascript console.log("two"); ``` ... thus resulting in the output that we saw above. The key insight here is that **code with callbacks does not execute in the order that the code is written**. Only the code *outside* of the callbacks executes in the written order. For example, we can be certain that `three` will get printed after `one` because both are outside of the callback and so they are executed in that order, but because `two` is printed from *inside* of a callback, we can't know when it will execute. *"But hold on"*, you say, *"then how can you know that `two` will be printed after `three` and `one`?"* This is where the earlier definition of "asynchronous code" comes into play! Let's reason through it: 1. `setTimeout` is asynchronous. 2. Therefore, we call `console.log("two")` from within an asynchronous callback. 3. Synchronous code executes within one tick. 4. Asynchronous code needs more than one tick to execute, ie. the asynchronous callback will be called in a *later* tick than the one where we started the operation (eg. `setTimeout`). 5. Therefore, an asynchronous callback will *always* execute after the synchronous code that started the operation, no matter what. 6. Therefore, `two` will *always* be printed after `one` and `three`. So, we can know when the asynchronous callback will be executed, in terms of relative time. That's useful, isn't it? Doesn't that mean that we can do that for *all* asynchronous code? Well, unfortunately not - it gets more complicated when there is *more* than one asynchronous operation. Take, for example, the following code: ```javascript console.log("one"); someAsynchronousOperation(() => { console.log("two"); }); someOtherAsynchronousOperation(() => { console.log("three"); }); console.log("four"); ``` We have two different asynchronous operations here, and we don't know for certain which of the two will finish faster. We don't even know whether it's always the *same* one that finishes faster, or whether it varies between runs of the program. So while we can determine that `two` and `three` will *always* be printed after `one` and `four` - remember, asynchronous callbacks in synchronous code - we *can't* know whether `two` or `three` will come first. And this is, fundamentally, what makes asynchronous code more difficult to write; you never know for sure in what order your code will complete. Every real-world program will have at least *some* scenarios where you can't force an order of operations (or, at least, not without horribly bad performance), so this is a problem that you *have* to account for in your code. The easiest solution to this, is to avoid "shared state". Shared state is information that you store (eg. in a variable) and that gets used by multiple parts of your code independently. This can sometimes be necessary, but it also comes at a cost - if function A and function B both modify the same variable, then if they run in a different order than you expected, one of them might mess up the expected state of the other. This is generally already true in programming, but even more important when working with asynchronous code, as your chunks of code get 'interspersed' much more due to the callback model. \[...\] # What is state?

This article was originally published at [https://gist.github.com/joepie91/8c2cba6a3e6d19b275fdff62bef98311](https://gist.github.com/joepie91/8c2cba6a3e6d19b275fdff62bef98311).

"State" is data that is associated with some part of a program, and that can be changed over time to change the behaviour of the program. It doesn't have to be changed by the user; it can be changed by *anything* in the program, and it can be *any* kind of data. It's a bit of an abstract concept, so here's an example: say you have a button that increases a number by 1 every time you click it, and the (pseudo-)code looks something like this: ```javascript let counter = 0; let increment = 1; button.on("click", () => { counter = counter + increment; }); ``` In this code, there are two bits of "state" involved: 1. **Whether the button is clicked:** This bit of data - specifically, the change between "yes" and "no" - is what determines when to increase the counter. The example code doesn't interact with this data directly, but the callback is called whenever it changes from "no" to "yes" and back again. 2. **The current value of the counter:** This bit of data is used to determine what the *next* value of the counter is going to be (the current value plus one), as well as what value to show on the screen. Now, you may note that we also define an `increment` variable, but that it isn't in the list of things that are "state"; this is because the `increment` value *never changes*. It's just a static value (`1`) that is always the same, even though it's stored in a variable. That means it's *not* state. You'll also note that "whether the button is clicked" isn't stored in any variable we have access to, and that we can't access the "yes" or "no" value directly. This is an example of what we'll call *invisible state* - data that *is* state, but that we cannot see or access directly - it only exists "behind the scenes". Nevertheless, it still affects the behaviour of the code through the event handler callback that we've defined, and that means it's still state.

# Promises reading list

This article was originally published at [https://gist.github.com/joepie91/791640557e3e5fd80861](https://gist.github.com/joepie91/791640557e3e5fd80861).

This is a list of examples and articles, in roughly the order you should follow them, to show and explain how promises work and why you should use them. I'll probably add more things to this list over time. This list primarily focuses on Bluebird, but the basic functionality should also work in ES6 Promises, and some examples are included on how to replicate Bluebird functionality with ES6 promises. You should still use Bluebird where possible, though - they are faster, less error-prone, and have more utilities.

I'm available for [tutoring and code review](http://cryto.net/~joepie91/code-review.html) :)

You may reuse all of the referenced posts and Gists (written by me) for any purpose under the [WTFPL](http://www.wtfpl.net/txt/copying/) / [CC0](https://creativecommons.org/publicdomain/zero/1.0/) (whichever you prefer). ### If you get stuck[](https://gist.github.com/joepie91/791640557e3e5fd80861#if-you-get-stuck) I've made a [brief FAQ](https://wiki.slightly.tech/books/miscellaneous-notes/page/the-promises-faq-addressing-the-most-common-questions-and-misconceptions-about-promises "The Promises FAQ - addressing the most common questions and misconceptions about Promises") of common questions that people have about Promises, and how to use them. If you don't understand something listed here, or you're wondering how to implement a specific requirement, chances are that it'll be answered in that FAQ. ### Compatibility[](https://gist.github.com/joepie91/791640557e3e5fd80861#compatibility) Bluebird will **not** work correctly (in client-side code) in older browsers. If you need to support older browsers, and you're using Webpack or Browserify, you should use the [`es6-promise`](https://www.npmjs.com/package/es6-promise) module instead, and reimplement behaviour where necessary. ### Introduction[](https://gist.github.com/joepie91/791640557e3e5fd80861#introduction) - Start reading [here](http://bluebirdjs.com/docs/why-promises.html), to understand why Promises matter. - If it's not quite clear yet, [some code that uses callbacks, and its equivalent using Bluebird](https://gist.github.com/joepie91/c6aa1ee552dcac821d03). - [A demonstration of how promise chains can be 'flattened'](https://gist.github.com/joepie91/211c8e99fb5a83b76079) ### Promise.try[](https://gist.github.com/joepie91/791640557e3e5fd80861#promisetry) Many guides and examples fail to demonstrate Promise.try, or to explain why it's important. [This article](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/) will explain it. ### Error handling[](https://gist.github.com/joepie91/791640557e3e5fd80861#error-handling) - [A quick introduction](https://gist.github.com/joepie91/c8d8cc4e6c2b57889446) - An illustration of error bubbling: [step 1](https://gist.github.com/joepie91/2b62b735020e51b260abacaa133f48f0), [step 2](https://gist.github.com/joepie91/b0c8f9a9309f5398080eab84482d58a4) - [Implementing 'fallback' values](https://gist.github.com/joepie91/f6a56acdae303e90e44a) (ie. defaults for when an asynchronous operation fails) - [bluebird-tap-error](https://www.npmjs.com/package/bluebird-tap-error), a module for intercepting and looking at errors, without preventing propagation. Useful if you need to do the actual error handling elsewhere. - [Handling errors in Express, using Promises](http://cryto.net/~joepie91/blog/2015/05/14/using-promises-bluebird-with-express/) Many examples on the internet don't show this, but you should **always** start a chain of promises with Promise.try, and if it is within a function or callback, you should always **return** your promises chain. Not doing so, will result in less reliable error handling and various other issues (eg. code executing too soon). ### Promisifying[](https://gist.github.com/joepie91/791640557e3e5fd80861#promisifying) - [Promisifying functions and modules that use nodebacks](http://bluebirdjs.com/docs/api/promisification.html) (Node.js callbacks) - [An example of manually promisifying an EventEmitter](https://gist.github.com/joepie91/3610c6e41bc654ccaadf) - [Promisifying `fs.exists`](https://gist.github.com/joepie91/bbf495e044da043de2ba) (which is async, but doesn't follow the nodeback convention) ### Functional (map, filter, reduce)[](https://gist.github.com/joepie91/791640557e3e5fd80861#functional-map-filter-reduce) - [Functional programming in Javascript: map, filter and reduce](http://cryto.net/~joepie91/blog/2015/05/04/functional-programming-in-javascript-map-filter-reduce/) (an introduction, not Bluebird-specific, but important to understand) - [(Synchronous) examples of map, filter, and reduce in Bluebird](https://gist.github.com/joepie91/34742045a40f7c48430e) - [Example of using map for retrieving a (remote) list of URLs with bhttp](https://gist.github.com/joepie91/4c125c45ee6c5ea0375f) ### Nesting[](https://gist.github.com/joepie91/791640557e3e5fd80861#nesting) - [Example of retaining scope through nesting](https://gist.github.com/joepie91/7d22af310ef68de4f507) - [Example of 'breaking out' of a chain through nesting](https://gist.github.com/joepie91/c5f99a18975df0bf2f98) - [Example of a nested Promise.map](https://gist.github.com/joepie91/2aafe9e4830e0d0c8171) - An example with increasing complexity, implementing an 'error-tolerant' Promise.map: [part 1](https://gist.github.com/joepie91/045a0238d0751cc7a72b), [part 2](https://gist.github.com/joepie91/11e36819dcca49f54348), [part 3](https://gist.github.com/joepie91/9593551b41f568a75b08) ### ES6 Promises[](https://gist.github.com/joepie91/791640557e3e5fd80861#es6-promises) - [Documentation on MDN](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise) - [Promise.try using ES6 Promises](https://gist.github.com/joepie91/255250eeea8b94572a03) - [Promise.delay using ES6 Promises](https://gist.github.com/joepie91/583db45f3a30552a7cd2) ### Odds and ends[](https://gist.github.com/joepie91/791640557e3e5fd80861#odds-and-ends) Some potentially useful snippets: - [Flattening an array of arrays, when using promises](https://gist.github.com/joepie91/ac1ee270c6a506405d5f) You're unlikely to need any of these things, if you just stick with either Bluebird or ES6 promises: - [How to test whether a Promises implementation handles callbacks correctly](https://gist.github.com/joepie91/48042173a6c9c4065399) - [Why this matters.](https://gist.github.com/joepie91/98576de0fab7badec167) # The Promises FAQ - addressing the most common questions and misconceptions about Promises

This article was originally published at [https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83). Nowadays Promises are more widely understood and supported, and it's not as relevant as it once was, but it's kept here for posterity.

By the way, I'm available for [tutoring and code review](http://cryto.net/~joepie91/code-review.html) :)

You'll find a table of contents on your left. ### 1. What Promises library should I use?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#1-what-promises-library-should-i-use) That depends a bit on your usecase. My usual recommendation is Bluebird - it's robust, has good error handling and debugging facilities, is fast, and has a well-designed API. The downside is that Bluebird will not correctly work in older browsers (think Internet Explorer 8 and older), and when used in Browserified/Webpacked code, it can sometimes add a lot to your bundle size. ES6 Promises are gaining a lot of traction purely because of being "ES6", but in practice they are just *not very good*. They are generally lacking standardized debugging facilities, they are missing essential utilities such as Promise.try/promisify/promisifyAll, they cannot catch specific error types (this is a big robustness issue), and so on. ES6 Promises can be useful in constrained scenarios (eg. older browsers with a polyfill, restricted non-V8 runtimes, etc.) but I would not generally recommend them. There are many other Promise implementations (Q, WhenJS, etc.) - but frankly, I've not seen any that are an improvement over either Bluebird or ES6 Promises in their respective 'optimal scenarios'. I'd also recommend explicitly *against* Q because it is extremely slow and has a very poorly designed API. **In summary:** Use Bluebird, unless you have a very specific reason not to. In those very specific cases, you probably want ES6 Promises. ### 2. How do I create a Promise myself?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#2-how-do-i-create-a-promise-myself) Usually, you don't. Promises are not usually something you 'create' explicitly - rather, they're a *natural consequence* of chaining together multiple operations. Take this example: ```javascript function getLinesFromSomething() { return Promise.try(() => { return bhttp.get("http://example.com/something.txt"); }).then((response) => { return response.body.toString().split("\n"); }); } ``` In this example, all of the following *technically* result in a new Promise: - `Promise.try(...)` - `bhttp.get(...)` - The synchronous value from the `.then` callback, which gets converted automatically to a resolved Promise (see [question 5](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#5-but-what-if-i-want-to-resolve-a-synchronous-result-or-error)) ... but none of them are explicitly created as "a new Promise" - that's just the natural consequence of starting a chain with `Promise.try` and then returning Promises or values from the callbacks. There is one example to this, where you *do* need to explicitly create a new Promise - when converting a different kind of asynchronous API to a Promises API, and even then you only need to do this if `promisify` and friends don't work. This is explained in [question 7](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#7-how-do-i-make-this-non-promises-library-work-with-promises). ### 3. How do I use `new Promise`?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#3-how-do-i-use-new-promise) You don't, usually. In almost every case, you either need [Promise.try](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/), or some kind of promisification method. [Question 7](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#7-how-do-i-make-this-non-promises-library-work-with-promises) explains how you should do promisification, and when you *do* need `new Promise`. But when in doubt, don't use it. It's very error-prone. ### 4. How do I resolve a Promise?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#4-how-do-i-resolve-a-promise) You don't, usually. Promises are not something you need to 'resolve' manually - rather, you should just *return* some kind of Promise, and let the Promise library handle the rest. There's one exception here: when you're manually promisifying a strange API using `new Promise`, you need to call `resolve()` or `reject()` for a successful and unsuccessful state, respectively. Make sure to read question 3, though - you should almost never actually *use* `new Promise`. ### 5. But what if I want to resolve a synchronous result or error?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#5-but-what-if-i-want-to-resolve-a-synchronous-result-or-error) You simply `return` it (if it's a result) or `throw` it (if it's an error), from your `.then` callback. When using Promises, synchronously returned values are automatically converted into a *resolved Promise*, whereas synchronously thrown errors are automatically converted into a *rejected Promise*. You don't need to use `Promise.resolve()` or `Promise.reject()`. ### 6. But what if it's at the start of a chain, and I'm not in a `.then` callback yet?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#6-but-what-if-its-at-the-start-of-a-chain-and-im-not-in-a-then-callback-yet) [Using Promise.try](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/) will make this problem not exist. ### 7. How do I make this non-Promises library work with Promises?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#7-how-do-i-make-this-non-promises-library-work-with-promises) That depends on what kind of API it is. - **Node.js-style error-first callbacks:** Use [Promise.promisify and/or Promise.promisifyAll](http://bluebirdjs.com/docs/api/promisification.html) to convert the library to a Promises API. For ES6 Promises, use the [es6-promisify](https://www.npmjs.com/package/es6-promisify) and [es6-promisify-all](https://www.npmjs.com/package/es6-promisify) libraries respectively. In Node.js, `util.promisify` can also be used. - **EventEmitters:** It depends. Promises are explicitly meant to represent an operation that succeeds or fails *precisely once*, so *most* EventEmitters cannot be converted to a Promise, as they will have *multiple* results. Some exceptions exist; for example, the `response` event when making a HTTP request - in these cases, use something like [bluebird-events](https://www.npmjs.com/package/bluebird-events). - **setTimeout:** Use [`Promise.delay`](http://bluebirdjs.com/docs/api/promise.delay.html) instead, which comes with Bluebird. - **setInterval:** Avoid `setInterval` entirely ([this is why](https://zetafleet.com/blog/2010/04/why-i-consider-setinterval-to-be-harmful.html)), and use a recursive `Promise.delay` instead. - **Asynchronous callbacks with a single result argument, and no `err`:** Use [promisify-simple-callback](https://www.npmjs.com/package/promisify-simple-callback). - **A different Promises library:** No manual conversion is necessary, as long as it is compliant with the Promises/A+ specification (and nearly every implementation is). Make sure to use [Promise.try](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/) in your code, though. - **Synchronous functions:** No manual conversion is necessary. Synchronous returns and throws are automatically converted by your Promises library. Make sure to use [Promise.try](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/) in your code, though. - **Something else not listed here:** You'll probably have to promisify it manually, using `new Promise`. Make sure to keep the code within `new Promise` as minimal as possible - you should have a function that *only* promisifies the API you intend to use, without doing *anything* else. All further processing should happen outside of `new Promise`, once you already have a Promise object. ### 8. How do I propagate errors, like with `if(err) return cb(err)`?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#8-how-do-i-propagate-errors-like-with-iferr-return-cberr) You don't. Promises will propagate errors automatically, and you don't need to do anything special for it - this is one of the benefits that Promises provide over error-first callbacks. When using Promises, the *only* case where you need to `.catch` an error, is if you intend to handle it - and you should always *only* catch the types of error you're interested in. These two Gists ([step 1](https://gist.github.com/joepie91/2b62b735020e51b260abacaa133f48f0), [step 2](https://gist.github.com/joepie91/b0c8f9a9309f5398080eab84482d58a4)) show how error propagation works, and how to `.catch` specific types of errors. ### 9. How do I break out of a Promise chain early?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#9-how-do-i-break-out-of-a-promise-chain-early) You don't. You [use conditionals instead](https://gist.github.com/joepie91/c5f99a18975df0bf2f98). Of course, specifically for *failure scenarios*, you'd still throw an error. ### 10. How do I convert a Promise to a synchronous value?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#10-how-do-i-convert-a-promise-to-a-synchronous-value) You can't. Once you write asynchronous code, all of the 'surrounding' code *also* needs to be asynchronous. However, you can just have a Promise chain in the 'parent code', and return the Promise from your own method. For example: ```javascript function getUserFromDatabase(userId) { return Promise.try(() => { return database.table("users").where({id: userId}).get(); }).then((results) => { if (results.length === 0) { throw new MyCustomError("No users found with that ID"); } else { return results[0]; } }); } /* Now, to *use* that getUserFromDatabase function, we need to have another Promise chain: */ Promise.try(() => { // Here, we return the result of calling our own function. That return value is a Promise. return getUserFromDatabase(42); }).then((user) => { console.log("The username of user 42 is:", user.username); }); ``` (If you're not sure what Promise.try is or does, [this article](http://cryto.net/~joepie91/blog/2016/05/11/what-is-promise-try-and-why-does-it-matter/) will explain it.) ### 11. How do I save a value from a Promise outside of the callback?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#11-how-do-i-save-a-value-from-a-promise-outside-of-the-callback) You don't. See [question 10](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#10-how-do-i-convert-a-promise-to-a-synchronous-value) above - you need to use Promises "all the way down". ### 12. How do I access previous results from the Promise chain?[](https://gist.github.com/joepie91/4c3a10629a4263a522e3bc4839a28c83#12-how-do-i-access-previous-results-from-the-promise-chain) In some cases, you might need to access an *earlier* result from a chain of Promises, one that you don't have access to anymore. A simple example of this scenario: ```javascript 'use strict'; // ... Promise.try(() => { return database.query("users", {id: req.body.userId}); }).then((user) => { return database.query("groups", {id: req.body.groupId}); }).then((group) => { res.json({ user: user, // This is not possible, because `user` is not in scope anymore. group: group }); }); ``` This is a fairly simple case - the `user` query and the `group` query are completely independent, and they can be run at the same time. Because of that, we can use `Promise.all` to run them in parallel, and return a combined Promise for *both* of their results: ```javascript 'use strict'; // ... Promise.try(() => { return Promise.all([ database.query("users", {id: req.body.userId}), database.query("groups", {id: req.body.groupId}) ]); }).spread((user, group) => { res.json({ user: user, // Now it's possible! group: group }); }); ``` Note that instead of `.then`, we use `.spread` here. Promises only support a *single* result argument for a `.then`, which is why a Promise created by `Promise.all` would resolve to an array of `[user, group]` in this case. However, `.spread` is a Bluebird-specific variation of `.then`, that will automatically "unpack" that array into multiple callback arguments. Alternatively, you can use [ES6 object destructuring](https://developer.mozilla.org/en/docs/Web/JavaScript/Reference/Operators/Destructuring_assignment) to accomplish the same. Now, the above example assumes that the two asynchronous operations are *independent* - that is, they can run in parallel without caring about the result of the other operation. In some cases, you will want to use the results of two operations that are *dependent* - while you still want to use the results of both at the same time, the second operation also needs the result of the first operation to work. An example: ```javascript 'use strict'; // ... Promise.try(() => { return getDatabaseConnection(); }).then((databaseConnection) => { return databaseConnection.query("users", {id: req.body.id}); }).then((user) => { res.json(user); // This is not possible, because we don't have `databaseConnection` in scope anymore: databaseConnection.close(); }); ``` In these cases, rather than using `Promise.all`, you'd *add a level of nesting* to keep something in scope: ```javascript 'use strict'; // ... Promise.try(() => { return getDatabaseConnection(); }).then((databaseConnection) => { // We nest here, so that `databaseConnection` remains in scope. return Promise.try(() => { return databaseConnection.query("users", {id: req.body.id}); }).then((user) => { res.json(user); databaseConnection.close(); // Now it works! }); }); ``` Of course, as with any kind of nesting, you should do it sparingly - and only when necessary for a situation like this. Splitting up your code into small functions, with each of them having a *single* responsibility, will prevent trouble with this. # Error handling (with Promises)

This article was originally published at [https://gist.github.com/joepie91/c8d8cc4e6c2b57889446](https://gist.github.com/joepie91/c8d8cc4e6c2b57889446). It only applies when using Promise chaining syntax; when you use `async`/`await`, you are instead expected to use `try`/`catch`, which unfortunately does not support error filtering.

There's roughly three types of errors: 1. **Expected errors** - eg. "URL is unreachable" for a link validity checker. You should handle these in your code at the top-most level where it is practical to do so. 2. **Unexpected errors** - eg. a bug in your code. These should crash your process (yes, really), they should be logged and ideally e-mailed to you, and you should fix them right away. You should never catch them for any purpose other than to log the error, and even then you should make the process crash. 3. **User-facing errors** - not really in the same category as the above two. While you can represent them with error objects (and it's often practical to do so), they're not really errors in the programming sense - rather, they're user feedback. When represented as error objects, these should only ever be handled at the top-most point of a request - in the case of Express, that would be the error-handling middleware that sends a HTTP status code and a response. ### Would I still need to use try/catch if I use promises? *Sort of.* Not the usual `try`/`catch`, but eg. Bluebird has a `.try` and `.catch` equivalent. It works like synchronous `try`/`catch`, though - errors are propagated upwards automatically so that you can handle them where appropriate. Bluebird's `try` isn't identical to a standard JS `try` - it's more a 'start using Promises' thing, so that you can also wrap synchronous errors. That's the magic of Promises, really - they let you handle synchronous and asynchronous errors/values like they're one and the same thing. Below is a relatively complex example, that uses a custom 'error filter' (predicate) function, because filesystem errors have a name but not a special error type. The error filtering is only available in Bluebird, by the way - 'native' Promises don't have the filtering. ```javascript /* UPDATED: This example has been changed to use the new object predicates, that were * introduced in Bluebird 3.0. If you are using Bluebird 2.x, you will need to use the * older example below, with the predicate function. */ var Promise = require("bluebird"); var fs = Promise.promisifyAll(require("fs")); Promise.try(function(){ return fs.readFileAsync("./config.json").then(JSON.parse); }).catch({code: "ENOENT"}, function(err){ /* Return an empty object. */ return {}; }).then(function(config){ /* `config` now either contains the JSON-parsed configuration file, or an empty object if no configuration file existed. */ }); ``` If you are still using Bluebird 2.x, you should use predicate functions instead: ```javascript /* This example is ONLY for Bluebird 2.x. When using Bluebird 3.0 or newer, you should * use the updated example above instead. */ var Promise = require("bluebird"); var fs = Promise.promisifyAll(require("fs")); var NonExistentFilePredicate = function(err) { return (err.code === "ENOENT"); }; Promise.try(function(){ return fs.readFileAsync("./config.json").then(JSON.parse); }).catch(NonExistentFilePredicate, function(err){ /* Return an empty object. */ return {}; }).then(function(config){ /* `config` now either contains the JSON-parsed configuration file, or an empty object if no configuration file existed. */ }); ```

# Bluebird Promise.try using ES6 Promises

This article was originally published at [https://gist.github.com/joepie91/255250eeea8b94572a03](https://gist.github.com/joepie91/255250eeea8b94572a03).

Note that this will only be equivalent to `Promise.try` if your runtime or ES6 Promise shim correctly catches synchronous errors in Promise constructors. If you are using the latest version of Node, this should be fine. ```javascript var Promise = require("es6-promise").Promise; module.exports = function promiseTry(func) { return new Promise(function(resolve, reject) { resolve(func()); }) } ```

# Please don't include minified builds in your npm packages!

This article was originally published at [https://gist.github.com/joepie91/04cc8329df231ea3e262dffe3d41f848](https://gist.github.com/joepie91/04cc8329df231ea3e262dffe3d41f848).

There's quite a few libraries on npm that not only include the regular build in their package, but also a minified build. While this may seem like a helpful addition to make the package more complete, it actually poses a real problem: it becomes very difficult to audit these libraries. ### The problem[](https://gist.github.com/joepie91/04cc8329df231ea3e262dffe3d41f848#the-problem) You've probably seen incidents like the [`event-stream` incident](https://blog.npmjs.org/post/180565383195/details-about-the-event-stream-incident), where a library was compromised in some way by an attacker. This sort of thing, also known as a "supply-chain attack", is starting to become more and more common - and it's something that developers need to protect themselves against. One effective way to do so, is by auditing dependencies. Having at least a cursory look through every dependency in your dependency tree, to ensure that there's nothing sketchy in there. While it isn't going to be 100% perfect, it will detect most of these attacks - and not only is briefly reviewing dependencies *still* faster than reinventing your own wheels, it'll also give you more insight into how your application actually works under the hood. But, there's a problem: a lot of packages include almost-duplicate builds, sometimes even minified ones. It's becoming increasingly common to see a separate CommonJS and ESM build, but in many cases there's a *minified* build included too. And those are basically impossible to audit! Even with a code beautifier, it's very difficult to understand what's really going on. But you can't ignore them either, because if they are a part of the package, then other code can require them. So you *have* to audit them. There's a workaround for this, in the form of "reproducing" the build; taking the original (Git) repository for the package which only contains the original code and not the minified code, checking out the intended version, and then just running a build that *creates* the minified version, which you can then compare to the one on npm. If they match, then you can assume that you only need to audit the original source in the Git repo. Or well, that *would* be the case, if it weren't possible for the *build tools* to introduce malicious code as well. Argh! Now you need to audit *all of the build tools being used* as well, at the specific versions that are being used by each dependency. Basically, you're now auditing hundreds of build stacks. This is a massive waste of time for every developer who wants to make sure there's nothing sketchy in their dependencies! All the while these minified builds don't really solve a problem. Which brings me to... ### Why it's unnecessary to include minified builds[](https://gist.github.com/joepie91/04cc8329df231ea3e262dffe3d41f848#why-its-unnecessary-to-include-minified-builds) As a library author, you are going to be dealing with roughly two developer demographics: 1. Those who just want a file they can include as a ` ``` ### Tag logic[](https://gist.github.com/joepie91/ed3a267de70210b46fb06dd57077827a#tag-logic) - **Conditionally add to DOM:** ` ... ` - **Conditionally display:** ` ... ` (but the tag always *exists* in the DOM) - **Conditionally hide:** ` ... ` (but the tag always *exists* in the DOM) - **For-each loop:** ` ... (you can access 'item' from within the tag) ... ` (one instance of `your-tag` for each `item` in `items`) - **For-each loop of an object:** ` ... (you can access 'key' and 'value' from within the tag) ... ` (this is *slow!*) All of the above also work on *regular* (ie. non-Riot) HTML tags. If you need to add/hide/display/loop a *group* of tags, rather than a single one, you can wrap them in a `` pseudo-tag. This works with all of the above constructs. For example: ```html {item.label} ``` # Quick reference for `checkit` validators

This article was originally published at [https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589).

### Presence[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#presence) - **exists -** The field must exist, and not be `undefined`. - **required -** The field must exist, and not be `undefined`, `null` or an empty string. - **empty -** The field must be some kind of "empty". Things that are considered "empty" are as follows: - `""` (empty string) - `[]` (empty array) - `{}` (empty object) - Other falsey values ### Character set[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#character-set) - **alpha -** `a-z`, `A-Z` - **alphaNumeric -** `a-z`, `A-Z`, `0-9` - **alphaUnderscore -** `a-z`, `A-Z`, `0-9`, `_` - **alphaDash -** `a-z`, `A-Z`, `0-9`, `_`, `-` ### Value[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#value) Length-related validators may apply to both strings and arrays. - **exactLength:`length`-** The value must have a length of exactly `length`. - **minLength:`length` -** The value must have a length of at least `length`. - **maxLength:`length` -** The value must have a length of at most `length`. - **contains:`needle` -** The value must contain the specified `needle` (applies to both strings and arrays). - **accepted -** Must be a value that indicates agreement - varies by language (defaulting to `en`): - **en, fr, nl -** `"yes"`, `"on"`, `"1"`, `1`, `"true"`, `true` - **es -** `"yes"`, `"on"`, `"1"`, `1`, `"true"`, `true`, `"si"` - **ru -** `"yes"`, `"on"`, `"1"`, `1`, `"true"`, `true`, `"да"` ### Value (numbers)[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#value-numbers) Note that "numbers" refers to both Number-type values, and strings containing numeric values! - **numeric -** Must be a finite numeric value of some sort. - **integer -** Must be an integer value (either positive or negative). - **natural -** Must be a natural number (ie. an integer value of 0 or higher). - **naturalNonZero -** Must be a natural number, but *higher* than 0 (ie. an integer value of 1 or higher). - **between:`min`:`max` -** The value must numerically be between the `min` and `max` values (exclusive). - **range:`min`:`max` -** The value must numerically be *within* the `min` and `max` values (inclusive). - **lessThan:`maxValue` -** The value must numerically be less than the specified `maxValue` (exclusive). - **lessThanEqualTo:`maxValue` -** The value must numerically be less than *or equal to* the specified `maxValue` (inclusive). - **greaterThan:`minValue` -** The value must numerically be greater than the specified `minValue` (exclusive). - **greaterThanEqualTo:`minValue` -** The value must numerically be greater than *or equal to* the specified `minValue` (inclusive). ### Relations to other fields[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#relations-to-other-fields) - **matchesField:`field` -** The value in this field must equal the value in the specified other `field`. - **different:`field` -** The value in this field must *not* equal the value in the specified other `field`. ### JavaScript types[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#javascript-types) - **NaN -** Must be `NaN`. - **null -** Must be `null` - **string -** Must be a `String`. - **number -** Must be a `Number`. - **array -** Must be an `Array`. - **plainObject -** Must be a plain `object` (ie. object literal). - **date -** Must be a `Date` object. - **function -** Must be a `Function`. - **regExp -** Must be a `RegExp` object. - **arguments -** Must be an `arguments` object. ### Format[](https://gist.github.com/joepie91/cd107b3a566264b28a3494689d73e589#format) - **email -** Must be a validly formatted e-mail address. - **luhn -** Must be a validly formatted creditcard number (according to a Luhn regular expression). - **url -** Must be a validly formatted URL. - **ipv4 -** Must be a validly formatted IPv4 address. - **ipv6 -** Must be a validly formatted IPv6 address. - **uuid -** Must be a validly formatted UUID. - **base64 -** Must be a validly formatted base64 string. # ES Modules are terrible, actually

This post was originally published at [https://gist.github.com/joepie91/bca2fda868c1e8b2c2caf76af7dfcad3](https://gist.github.com/joepie91/bca2fda868c1e8b2c2caf76af7dfcad3), which was in turn adapted from an earlier [Twitter thread](https://twitter.com/joepie91/status/1254368447250694146).

It's incredible how many collective developer hours have been wasted on pushing through the turd that is ES Modules (often mistakenly called "ES6 Modules"). Causing a big ecosystem divide and massive tooling support issues, for... well, no reason, really. There are no actual advantages to it. At all. It looks shiny and new and some libraries use it in their documentation without any explanation, so people assume that it's the new thing that must be used. And then I end up having to explain to them why, unlike CommonJS, it doesn't actually work everywhere yet, and may never do so. For example, you [can't import ESM modules from a CommonJS file](https://github.com/sindresorhus/p-defer/issues/7)! (Update: I've released a [module](https://www.npmjs.com/package/fix-esm) that works around this issue.) And then there's Rollup, which apparently requires ESM to be used, at least to get things like treeshaking. Which then makes people believe that treeshaking is not possible with CommonJS modules. Well, [it is](https://github.com/indutny/webpack-common-shake) - Rollup just chose not to support it. And then there's Babel, which tried to transpile `import`/`export` to `require`/`module.exports`, sidestepping the ongoing effort of standardizing the module semantics for ESM, causing broken imports and `require("foo").default` nonsense and spec design issues all over the place. And then people go "but you can use ESM in browsers without a build step!", apparently not realizing that that is an utterly useless feature because loading a full dependency tree over the network would be unreasonably and unavoidably slow - you'd need as many roundtrips as there are levels of depth in your dependency tree - and so you need some kind of build step anyway, eliminating this entire supposed benefit. And then people go "well you can statically analyze it better!", apparently not realizing that ESM doesn't actually change any of the JS semantics other than the `import`/`export` syntax, and that the `import`/`export` statements are equally analyzable as top-level `require`/`module.exports`. *"But in CommonJS you can use those elsewhere too, and that breaks static analyzers!"*, I hear you say. Well, yes, absolutely. But that is inherent in dynamic imports, which by the way, ESM also supports with its dynamic `import()` syntax. So it doesn't solve that either! Any static analyzer still needs to deal with the case of dynamic imports *somehow* - it's just rearranging deck chairs on the Titanic. And *then*, people go "but now we at least have a standard module system!", apparently not realizing that CommonJS was *literally that*, the result of an attempt to standardize the various competing module systems in JS. Which, against all odds, *actually succeeded*! ... and then promptly got destroyed by ESM, which reintroduced a split and all sorts of incompatibility in the ecosystem, rather than just importing some updated variant of CommonJS into the language specification, which would have sidestepped almost all of these issues. And while the initial CommonJS standardization effort succeeded due to none of the competing module systems being in particularly widespread use yet, CommonJS is so ubiquitous in Javascript-land nowadays that it will never fully go away. Which means that runtimes will forever have to keep supporting two module systems, and **developers will forever be paying the cost of the interoperability issues between them**. ### But it's the future![](https://gist.github.com/joepie91/bca2fda868c1e8b2c2caf76af7dfcad3#but-its-the-future) Is it really? The vast majority of people who believe they're currently using ESM, aren't even actually doing so - they're feeding their entire codebase through Babel, which deftly converts all of those snazzy `import` and `export` statements back into CommonJS syntax. Which works. So what's the point of the new module system again, if it all works with CommonJS anyway? And it gets worse; `import` and `export` are designed as special-cased statements. Aside from the obvious problem of needing to learn a special syntax (which doesn't *quite* work like object destructuring) instead of reusing core language concepts, this is also a downgrade from CommonJS' `require`, which is a *first-class expression* due to just being a function call. That might sound irrelevant on the face of it, but it has very real consequences. For example, the following pattern is simply **not possible** with ESM: ```javascript const someInitializedModule = require("module-name")(someOptions); ``` Or how about this one? Also no longer possible: ```javascript const app = express(); // ... app.use("/users", require("./routers/users")); ``` Having language features available as a first-class expression is one of the most desirable properties in language design; yet for some completely unclear reason, ESM proponents decided to *remove* that property. There's just no way anymore to directly combine an `import` statement with some other JS syntax, whether or not the module path is statically specified. The only way around this is with `await import`, which would break the supposed static analyzer benefits, only work in async contexts, and even then require weird hacks with parentheses to make it work correctly. It also means that you now need to make a choice: do you want to be able to use ESM-only dependencies, or do you want to have access to patterns like the above that help you keep your codebase maintainable? ESM or maintainability, your choice! So, congratulations, ESM proponents. You've destroyed a successful userland specification, wasted many (hundreds of?) thousands of hours of collective developer time, many hours of my own personal unpaid time trying to support people with the fallout, and created ecosystem fragmentation that will never go away, in exchange for... fuck all. This is a disaster, and the only remaining way I see to fix it is to stop trying to make ESM happen, and deprecate it in favour of some variant of CommonJS modules being absorbed into the spec. It's not too late *yet*; but at some point it will be. # A few notes on the "Gathering weak npm credentials" article

This article was originally published in 2017 at [https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436). Since then, npm has implemented 2FA support in the registry, and was acquired by Microsoft through Github.

Yesterday, [an article was released](https://github.com/ChALkeR/notes/blob/master/Gathering-weak-npm-credentials.md) that describes how one person could obtain access to enough packages on npm to affect 52% of the package installations in the Node.js ecosystem. Unfortunately, this has brought about some comments from readers that completely miss the mark, and that draw away attention from the real issue behind all this. To be very clear: **This (security) issue was caused by 1) poor password management on the side of developers, 2) handing out unnecessary publish access to packages, and most of all 3) poor security on the side of the npm registry.** With that being said, let's address some of the common claims. This is going to be slightly ranty, because to be honest I'm rather disappointed that otherwise competent infosec people distract from the underlying causes like this. All that's going to do is prevent this from getting fixed in *other* language package registries, which almost certainly suffer from the same issues. ### "This is what you get when you use small dependencies, because there are such long dependency chains"[](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436#this-is-what-you-get-when-you-use-small-dependencies-because-there-are-such-long-dependency-chains) This is very unlikely to be a relevant factor here. Don't forget that a key part of the problem here is that publisher access is handed out unnecessarily; if the Node.js ecosystem were to consist of a few large dependencies (that everybody used) instead of many small ones (that are only used by those who actually need the entire dependency), you'd just end up with each large dependency being responsible for *a larger part of the 52%*. There's a potential point of discussion in that a modular ecosystem means that more different groups of people are involved in the implementation of a given dependency, and that this could provide for a larger (human) attack surface; however, *this is a completely unexplored argument for which no data currently exists*, and this particular article does not provide sufficient evidence to show it to be true. Perhaps not surprisingly, the "it's because of small dependencies" argument seems to come primarily from people who don't fully understand the Node.js dependency model and make a lot of (incorrect) assumptions about its consequences, and who appear to take every opportunity to blame things on "small dependencies" regardless of technical accuracy. **In short:** No, this is not because of small dependencies. It would very likely happen with large dependencies as well. ### "See, that's why you should always lock your dependency versions. This is why semantic versioning is bad."[](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436#see-thats-why-you-should-always-lock-your-dependency-versions-this-is-why-semantic-versioning-is-bad) Aside from semantic versioning being a practice that's separate from automatically updating based on a semver range, preventing automatic updates isn't going to prevent this issue either. The problem here is with *publish access to the modules*, which is a completely separate concern from "how the obtained access is misused". In practice, most people who "lock dependency versions" seem to follow a practice of "automatically merge any update that doesn't break tests" - which really is no different from just letting semver ranges do their thing. Even if you *do* audit updates before you apply them (and let's be realistic, how many people *actually* do this for every update?), it would be trivial to subtly backdoor most of the affected packages due to their often aging and messy codebase, where one more bit of strange code doesn't really stand out. The chances of locked dependencies preventing exploitation are close to zero. Even if you *do* audit your updates, it's relatively trivial for a competent developer to sneak by a backdoor. At the same time, "people not applying updates" is a far bigger security issue than audit-less dependency locking will solve. All this applies to "vendoring in dependencies", too - vendoring in dependencies is no technically different from pinning a version/hash of a dependency. **In short:** No, dependency locking will not prevent exploitation through this vector. Unless you have a strict auditing process (which you should, but many do not), you **should not** lock dependency versions. ### "That's why you should be able to add a hash to your package.json, so that it verifies the integrity of the dependency.[](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436#thats-why-you-should-be-able-to-add-a-hash-to-your-packagejson-so-that-it-verifies-the-integrity-of-the-dependency) This solves a completely different and almost unimportant problem. The only thing that a package hash will do, is assuring that everybody who installs the dependencies gets the exact same dependencies (for a locked set of versions). However, the npm registry *already does that* - it prevents republishing different code under an already-used version number, and even with publisher access you cannot bypass that. Package hashes also give you absolutely zero assurances about future updates; *package hashes are not signatures*. **In short:** This just doesn't even have anything to do with the credentials issue. It's totally unrelated. ### "See? This is why Node.js is bad."[](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436#see-this-is-why-nodejs-is-bad) Unfortunately plenty of people are conveniently using this article as an excuse to complain about Node.js (because that's apparently the hip thing to do?), without bothering to understand what happened. Very simply put: **this issue is not in any way specific to Node.js.** The issue here is an issue of developers with poor password policies and poor registry access controls. It just so happens that the research was done on npm. As far as I am aware, this kind of research has not been carried out for *any* other language package registries - but many other registries appear to be similarly poorly monitored and secured, and are very likely to be subject to the exact same attack. If you're using this as an excuse to complain about Node.js, without bothering to understand the issue well enough to realize that it's a *language-independent issue*, then perhaps you should reconsider exactly how well-informed your point of view of Node.js (or other tools, for that matter) really is. Instead, you should take this as a lesson and *prevent this from happening in other language ecosystems*. **In short:** This has absolutely nothing to do with Node.js specifically. That's just where the research happens to be done. Take the advice and start looking at other language package registries, to ensure they are not vulnerable to this either. ### So then how should I fix this?[](https://gist.github.com/joepie91/828532657d23d512d76c1e68b101f436#so-then-how-should-i-fix-this) 1. Demand from npm Inc. that they prioritize implementing 2FA immediately, actively monitor for incidents like this, and generally implement all the mitigations suggested in [the article](https://github.com/ChALkeR/notes/blob/master/Gathering-weak-npm-credentials.md#how-things-could-be-further-improved-on-the-npm-side). It's really not reasonable how poorly monitored or secured the registry is, especially given that it's *operated by a commercial organization*, and it's been around for a *long* time. 2. If you have an npm account, follow the instructions [here](https://github.com/ChALkeR/notes/blob/master/Gathering-weak-npm-credentials.md#what-users-should-do-on-this). 3. Carry out or encourage the same kind of research on the package registry for *your* favorite language. It's very likely that other package registries are similarly insecure and poorly monitored. Unfortunately, as a mere consumer of packages, there's nothing you can do about this other than demanding that npm Inc. gets their registry security in order. This is fundamentally an infrastructure problem. # Node.js Things that are specific to Node.js. Note that things about Javascript in general, are found under their own "Javascript" chapter! # How to install Node.js applications, if you're not a Node.js developer

This article was originally published at [https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6).

While installing a Node.js application isn't difficult *in principle*, it may still be confusing if you're not used to how the Node.js ecosystem works. This post will tell you how to get the application going, what to expect, and what to do if it doesn't work. *Occasionally* an application may have custom installation steps, such as installing special system-wide dependencies; in those cases, you'll want to have a look at the install documentation of the application itself as well. However, *most of the time* it's safe to assume that the instructions below will work fine. If the application you want to install is available in your distribution's repositories, then install it through there instead and skip this entire guide; your distribution's package manager will take care of all the dependencies. ### Checklist[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#checklist) Before installing a Node.js application, check the following things: 1. **You're running a maintained version of Node.js.** You can find a list of current maintained versions [here](https://github.com/nodejs/Release#release-schedule). For minimal upgrade headaches, ensure that you're running an LTS version. If your system is running an *unsupported* version, you should install Node.js [from the Nodesource repositories](https://github.com/nodesource/distributions) instead. 2. **Your version of Node.js is a standard one.** In particular Debian and some Debian-based distributions have a habit of modifying the way Node.js works, leading to a lot of things breaking. Try running `node --version` - if that works, you're running a standard-enough version. If you can only do `nodejs --version`, you should install Node.js [from the Nodesource repositories](https://github.com/nodesource/distributions) instead. 3. **You have build tools installed.** In particular, you'll want to make sure that `make`, `pkgconfig`, GCC and Python exist on your system. If you don't have build tools or you're unsure, you'll want to install a package like `build-essential` (on Linux) or [look here for further instructions](https://github.com/nodejs/node-gyp#installation) (on other platforms, or unusual Linux distributions). 4. ***npm* works.** Run `npm --version` to check this. If the `npm` command doesn't exist, your distribution is probably shipping a weird non-standard version of Node.js; use [the Nodesource repositories](https://github.com/nodesource/distributions) instead. **Do not** install npm as a separate package, this will lead to headaches down the road. No root/administrator access, no repositories exist for your distro, can't change your system-wide Node.js version, need a really specific Node.js version to make the application work, or have some other sort of edge case? Then [nvm](https://github.com/creationix/nvm/blob/master/README.md) can be a useful solution, although keep in mind that it *will not* automatically update your Node.js installation. ### How packages work in Node.js[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#how-packages-work-in-nodejs) Packages work a little differently in Node.js from most languages and distributions. In particular, **dependencies are *not* installed system-wide**. Every project has its own (nested) set of dependencies. This solves a lot of package management problems, but it can take a little getting used to if you're used to other systems. In practice, this means that you should almost always do a regular `npm install` - that is, installing the dependencies locally into the project. The only time you need to do a 'global installation' (using `npm install -g packagename`) is when you're installing an *application* that is *itself* published on npm, and you want it to be available globally on your system. This also means that **you should *not* run npm as root** by default. This is a really important thing to internalize, or you'll run into trouble down the line. To recap: - Run npm under your own, unprivileged user - unless instructions *specifically* state that you should run it as root. - Run npm in 'local' mode, installing dependencies into the project folder - unless instructions *specifically* state that you should do a global installation. If you're curious about the details of packages in Node.js, [here](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2) is a developer-focused article about them. ### Installing an application from the npm registry[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#installing-an-application-from-the-npm-registry) Is the application published on the npm registry, ie. does it have a page on `npmjs.org`? Great! That means that installation is a single command. If you've installed Node.js through your distribution's package manager: `sudo npm install -g packagename`, where `packagename` is the name of the package on npm. If you've installed Node.js through `nvm` or a similar tool: `npm install -g packagename`, where `packagename` is the name of the package on npm. You'll notice that you need to run the command as root (eg. through `sudo`) when installing Node.js through your distribution's package manager, but not when installing it through `nvm`. This is because by default, Node.js will use a system-wide folder for globally installed packages; but under `nvm`, your entire Node.js installation exists in a subdirectory of your unprivileged user's home directory - including the 'global packages' folder. After following these steps, some new binaries will probably be available for you to use system-wide. If the application's documentation doesn't tell you what binaries are available, then you should find its code repository, and look at the `"bin"` key in its `package.json`; that will contain a list of all the binaries it provides. Running them with `--help` will probably give you documentation. You're done!

**If you run into a problem:** Scroll down to the 'troubleshooting' section.

### Installing an application from a repository[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#installing-an-application-from-a-repository) Some applications are not published to the npm registry, and instead you're expected to install it from the code (eg. Git) repository. In those cases, start by looking at the application's install instructions to see if there are special requirements for cloning the repository, like eg. checking out submodules. If there are no special instructions, then a simple `git clone http://example.com/path/to/repository` should work, replacing the URL with the cloning URL of the repository. #### Making it available globally (like when installing from the npm registry) Enter the cloned folder, and then run: - If you installed Node.js from your distribution's repositories: `sudo npm install -g`, with no other arguments. - If you installed Node.js through `nvm` or a similar tool: `npm install -g`, with no other arguments. You're done!

**If you run into a problem:** Scroll down to the 'troubleshooting' section.

#### Keeping it in the repository[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#keeping-it-in-the-repository) Sometimes you don't want to really install the application onto your system, but you rather just want to get it running locally from the repository. In that case, enter the cloned folder, and run: `npm install`, with no other arguments. You're done!

**If you run into a problem:** Scroll down to the 'troubleshooting' section.

### Troubleshooting[](https://gist.github.com/joepie91/24f4e70174d10325a9af743a381d5ec6#troubleshooting) Sometimes, things still won't work. In most cases it'll be a matter of missing some sort of undocumented external dependency, ie. a dependency that npm can't manage for you and that's typically provided by the OS. Sometimes it's a version compatibility issue. Occasionally applications are just outright broken. When running into trouble with npm, try entering your installation output into [this tool](http://cryto.net/why-is-npm-broken/) first. It's able to (fully automatically!) recognize the most common issues that people tend to run into with npm. If the tool can't find your issue and it still doesn't work, then drop by the IRC channel (#Node.js on Libera, an online chat can be found [here](https://web.libera.chat/)) and we'll be happy to help you get things going! You do need to register your username to talk in the channel; you can get help with this in the #libera channel. # Getting started with Node.js

This article was originally published at [https://gist.github.com/joepie91/95ed77b71790442b7e61](https://gist.github.com/joepie91/95ed77b71790442b7e61). Some of the links in it still point to Gists that I have written; these will be moved over and relinked in due time.

Some of the suggestions on this page have become outdated, and better alternatives are available nowadays. However, the suggestions listed here *should still work today* as they did when this article was originally written. You do not *need* to update things to new approaches, and sometimes the newer approaches actually aren't better either, they can even be worse!

*"How do I get started with Node?"* is a commonly heard question in #Node.js. This gist is an attempt to compile some of the answers to that question. It's a perpetual [work-in-progress](https://gist.github.com/joepie91/95ed77b71790442b7e61#future-additions-to-this-list). And if this list didn't quite answer your questions, I'm available for [tutoring and code review](http://cryto.net/~joepie91/code-review.html)! A [donation](http://cryto.net/~joepie91/donate.html) is also welcome :) ### Setting expectations[](https://gist.github.com/joepie91/95ed77b71790442b7e61#setting-expectations) Before you get started learning about JavaScript and Node.js, there's one very important article you need to read: [Teach Yourself Programming in Ten Years](https://web.archive.org/web/20161125182601/http://www.norvig.com/21-days.html). Understand that **it's going to take time** to learn Node.js, just like it would take time to learn any other specialized topic - and that you're not going to learn effectively just by reading things, or following tutorials or courses. **Get out there and build things!** Experience is by far the most important part of learning, and shortcuts to this simply *do not exist*. Avoid "bootcamps", courses, extensive books, and basically anything else that claims to teach you programming (or Node.js) in a single run. They all lie, and what they promise you simply isn't possible. That's also the reason this post is a *list of resources*, rather than a single book - they're references for when you need to learn about a certain topic at a certain point in time. Nothing more, nothing less. There's also no such thing as a "definitive guide to Node.js", or a "perfect stack". Every project is going to have different requirements, that are best solved by different tools. There's no point in trying to learn everything upfront, because *you can't know what you need to learn, until you actually need it*. In conclusion, the best way to get started with Node.js is to simply **decide on a project you want to build, and start working on it**. Start with the simplest possible implementation of it, and over time add bits and pieces to it, learning about those bits and pieces as you go. The links in this post will help you with that.

You'll find a table of contents for this page on your left.

### Javascript refresher[](https://gist.github.com/joepie91/95ed77b71790442b7e61#javascript-refresher) Especially if you normally use a different language, or you only use Javascript occasionally, it's easy to misunderstand some of the aspects of the language. These links will help you refresh your knowledge of JS, and make sure that you understand the OOP model correctly. - **A whirlwind tour of the language:** [http://learnxinyminutes.com/docs/javascript/](http://learnxinyminutes.com/docs/javascript/) - **Javascript is asynchronous, through using an 'event loop'.** [This video](https://www.youtube.com/watch?v=8aGhZQkoFbQ) explains what an event loop *is*, and [this video](https://www.youtube.com/watch?v=cCOL7MC4Pl0) goes into more detail about how it works and how to deal with corner cases. If you're not familiar with the event loop yet, you should watch both. - **Javascript does automatic typecasting ("type conversion") in some cases.** [This](https://gist.github.com/joepie91/5fd8c58345998d4dec5b) shows how various values cast to a boolean, and [this](https://gist.github.com/joepie91/b207efcfc6ace64f0f41) shows how `null` and `undefined` relate to each other. - **In Javascript, braces are optional for single-line statements - however, you should *always* use them.** [This gist](https://gist.github.com/joepie91/203aaa8e36e1eb958fe7) demonstrates why. - **Asynchronous execution in Javascript is normally implemented using CPS.** This stands for "continuation-passing style", and [this](https://gist.github.com/joepie91/977c966cb0d6c15690b0) shows an example of how that works. - **However, in practice, you shouldn't use that, and you should use Promises instead.** Whereas it is very easy to mess up CPS code, that is not an issue with Promises - error handling is much more reliable, for example. [This guide](https://gist.github.com/joepie91/791640557e3e5fd80861) should give you a decent introduction. - **A callback should be either consistently synchronous, or consistently asynchronous.** You don't really have to worry about this when you're using Promises (as they ensure that this is consistent), but [this article](http://blog.izs.me/post/59142742143/designing-apis-for-asynchrony) still has a good explanation of the reasons for this. A simpler example can be found [here](https://gist.github.com/joepie91/98576de0fab7badec167). - **Javascript does not have classes, and constructor functions are a bad idea.** [This short article](https://hughfdjackson.com/javascript/prototypes-the-short(est-possible)-story/) will help you understand the prototypical OOP model that Javascript uses. [This gist](https://gist.github.com/joepie91/22fb5cd443517566412b) shows a brief example of what the `this` variable refers to. Often you don't need inheritance at all - [this gist](https://gist.github.com/joepie91/657f2b4b054d90aa0c87) shows an example of creating an object in the simplest possible way. - **In Javascript, closures are everywhere, by default.** [This gist](https://gist.github.com/joepie91/cb0e42c3562bc94e4491) shows an example. ### The Node.js platform[](https://gist.github.com/joepie91/95ed77b71790442b7e61#the-nodejs-platform) Node.js is not a language. Rather, it's a "runtime" that lets you run Javascript without a browser. It comes with some basic additions such as a TCP library - or rather, in Node.js-speak, a "TCP module" - that you need to write server applications. - **The easiest way to install Node.js on Linux and OS X, is to use `nvm`.** The instructions for that can be found [here](https://github.com/creationix/nvm/blob/master/README.markdown). Make sure you create a `default` alias (as explained in the documentation), if you want it to work like a 'normal' installation. - **If you are using Windows:** You can download an installer from [the Node.js website](https://nodejs.org/en/). You should consider using a different operating system, though - Windows is generally rather poorly suited for software development outside of .NET. Things will be a lot easier if you use Linux or OS X. - **The package manager you'll use for Node.js, is called NPM.** While it's very simple to use, it's not particularly well-documented. [This article](https://gist.github.com/joepie91/9b9dbd8c9ac3b55a65b2) will give you an introduction to it. - **Don't hesitate to add dependencies, even small ones!** Node.js and NPM are specifically designed to make this possible without running into issues, and you will get big benefits from doing so. [This post](https://github.com/sindresorhus/ama/issues/10#issuecomment-117766328) explains more about that. - **The module system is very simple.** [The Node.js documentation explains this further.](https://nodejs.org/api/modules.html) - **MongoDB is commonly recommended and used with Node.js. It is, however, extremely poorly designed - and you shouldn't use it.** [This article](http://cryto.net/~joepie91/blog/2015/07/19/why-you-should-never-ever-ever-use-mongodb/) goes into more detail about *why* you shouldn't use it. If you're not sure what to use, use [PostgreSQL](http://www.postgresql.org/). - The rest of the documentation for all the modules included with Node.js, can be found [here](https://nodejs.org/api/). ### Setting up your environment[](https://gist.github.com/joepie91/95ed77b71790442b7e61#setting-up-your-environment) - **To be able to install "native addons" (compiled C++ modules), you need to take some additional steps.** If you are on Linux or OS X, you likely already have everything you need - however, on Windows you'll have to install a few additional pieces of software. The instructions for all of these platforms can be found [here](https://github.com/nodejs/node-gyp#installation). **Do not skip this step.** Installing pure-Javascript modules is not always a viable solution, especially where it concerns cryptography-related modules such as `scrypt` or `bcrypt`. - **If you're running into issues on Windows,** try [these instructions](https://github.com/Microsoft/nodejs-guidelines/blob/master/windows-environment.md#compiling-native-addon-modules) from Microsoft. - **There are a lot of build tools for helping you manage your code.** It can get a bit confusing, though - there are a lot of articles that just tell you to combine a pile of different tools, without ever explaining what they're for. [This](https://gist.github.com/joepie91/3381ce7f92dec7a1e622538980c0c43d) is a hype-free overview of different kinds of build tools, and what they may be useful for. ### Functional programming[](https://gist.github.com/joepie91/95ed77b71790442b7e61#functional-programming) Javascript has part of its roots in functional programming languages, which means that you can use some of those concepts in your own projects. They can be greatly beneficial to the readability and maintainability of your code. - [This article](http://cryto.net/~joepie91/blog/2015/05/04/functional-programming-in-javascript-map-filter-reduce/) gives an introduction to `map`, `filter` and `reduce` - three functional programming operations that help a *lot* in writing maintainable and predictable code. - [This gist](https://gist.github.com/joepie91/34742045a40f7c48430e) shows an example of using those with Bluebird, the Promises library that I recommended in the Promises Reading Guide. - [This slide deck](http://slides.com/gsklee/functional-programming-in-5-minutes) demonstrates currying in Javascript, another functional programming technique - think of them as "partially executed functions". ### Module patterns[](https://gist.github.com/joepie91/95ed77b71790442b7e61#module-patterns) To build "configurable" modules, you can use a pattern known as "parametric modules". [This gist](https://gist.github.com/joepie91/83a8e03ad931e696df22) shows an example of that. [This](https://gist.github.com/joepie91/9b1ca2392a72e82b44fb) is another example. A commonly used pattern is the `EventEmitter` - this is exactly what it sounds like; an object that emits events. It's a very simple abstraction, but helps greatly in writing [loosely coupled](https://en.wikipedia.org/wiki/Loose_coupling) code. [This gist](https://gist.github.com/joepie91/82df4eff6956089e3fbf) illustrates the object, and the full documentation can be found [here](https://nodejs.org/api/events.html). ### Code architecture[](https://gist.github.com/joepie91/95ed77b71790442b7e61#code-architecture) The 'design' of your codebase matters a lot. Certain approaches for solving a problem work better than other approaches, and each approach has its own set of benefits and drawbacks. Picking the right approach is important - it will save you hours (or days!) of time down the line, when you are maintaining your code. I'm still in the process of writing more about this, but so far, I've already written an article that explains the difference between monolithic and modular code and why it matters. You can read it [here](https://gist.github.com/joepie91/7f03a733a3a72d2396d6). ### Express[](https://gist.github.com/joepie91/95ed77b71790442b7e61#express) If you want to build a website or web application, you'll probably find [Express](http://expressjs.com/) to be a good framework to start with. As a framework, it is *very* small. It only provides you with the basic necessities - everything else is a plugin. If this sounds complicated, don't worry - things almost always work "out of the box". Simply follow the `README` for whichever "middleware" (Express plugin) you want to add. To get started with Express, simply follow the below articles. Whatever you do, don't use the Express Generator - it generates confusing and bloated code. Just start from scratch and follow the guides! - [Installing Express](http://expressjs.com/en/starter/installing.html) (some of this was already covered in the NPM guide above) - [A Hello World example](http://expressjs.com/en/starter/hello-world.html) - [Routing](http://expressjs.com/en/guide/routing.html) - [Using template engines](http://expressjs.com/en/guide/using-template-engines.html) - [Writing Middleware](http://expressjs.com/en/guide/writing-middleware.html) - [Using Middleware](http://expressjs.com/en/guide/using-middleware.html) - [Static File Handling](http://expressjs.com/en/starter/static-files.html) (this is middleware, too!) - [Error Handling](http://expressjs.com/en/guide/error-handling.html) - [Debugging](http://expressjs.com/en/guide/debugging.html) To get a better handle on how to render pages server-side with Express: - [Rendering pages server-side with Express (and Pug)](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1) (a step-by-step walkthrough, work in progress) Some more odds and ends regarding about Express: - [Some FAQs](http://expressjs.com/en/starter/faq.html) (don't use MVC, however - [this is why](http://aredridel.dinhe.net/2015/01/30/why-mvc-does-not-fit-the-web/).) - [Express Behind Proxies](http://expressjs.com/en/guide/behind-proxies.html) - [The full Express API documentation](http://expressjs.com/en/4x/api.html) Some examples: - [Making something "globally available" in an Express application](https://gist.github.com/joepie91/a8270b0b7a1a433032a2) - [Writing configurable middleware](https://gist.github.com/joepie91/7f531cc7fa7245e68cc8) (using the same technique as the parametric modules I showed earlier) Combining Express and Promises: - [A short article explaining how to use `express-promise-router`](http://cryto.net/~joepie91/blog/2015/05/14/using-promises-bluebird-with-express/) - [An example](https://gist.github.com/joepie91/e4cd0f2c84ea2f303bb2), also explaining what would happen if you didn't handle errors. Some common Express middleware that you might want to use: - **Sessions:** [express-session](https://www.npmjs.com/package/express-session), with [connect-session-knex](https://www.npmjs.com/package/connect-session-knex) if you are using Knex. - **Message flashing:** [connect-flash](https://www.npmjs.com/package/connect-flash) - **Handling request payloads ("form/POST data"):** [body-parser](https://www.npmjs.com/package/body-parser) - **Handling uploads and other multipart data:** [multer](https://www.npmjs.com/package/multer) if you want it written to disk like PHP would do, or [connect-busboy](https://www.npmjs.com/package/connect-busboy) if you want to interact with the upload stream directly. - **Access logs:** [morgan](https://www.npmjs.com/package/morgan) - **OAuth/OpenID integration:** [Passport](http://passportjs.org/) ### Coming from other languages or platforms[](https://gist.github.com/joepie91/95ed77b71790442b7e61#coming-from-other-languages-or-platforms) - **If you are used to PHP or similar:** Contrary to PHP, Node.js does *not* use a CGI-like model (ie. "one pageload is one script"). Instead, it is a persistent process - your code *is* the webserver, and it handles many incoming requests at the same time, for as long as the process keeps running. This means you can have persistent state - [this gist](https://gist.github.com/joepie91/bf0813626e6568e8633b) shows an example of that. - **If you are used to synchronous platforms:** [This gist](https://gist.github.com/joepie91/dc67316d2a22f321d1a1) illustrates the differences between a (synchronous) PHP script and an (asynchronous) Node.js application. ### Security[](https://gist.github.com/joepie91/95ed77b71790442b7e61#security) Note that this advice isn't necessarily complete. It answers some of the most common questions, but your project might have special requirements or caveats. When in doubt, you can always ask in the #Node.js channel! Also, keep in mind the golden rule of security: humans *suck* at repetition, regardless of their level of competence. **If a mistake *can* be made, then it *will* be made.** Design your systems such that they are hard to use incorrectly. - **Sessions:** Use something that implements session cookies. If you're using Express, [express-session](https://www.npmjs.com/package/express-session) will take care of this for you. Whatever you do, **don't use JWT for sessions**, even if many blog posts recommend it - it will cause security problems. [This article](http://cryto.net/~joepie91/blog/2016/06/13/stop-using-jwt-for-sessions/) goes into more detail. - **Password hashing:** Use `scrypt`. [This wrapper module](https://www.npmjs.com/package/scrypt-for-humans) will make it easier to use. - **CSRF protection:** You need this if you are building a website. Use [csurf](https://www.npmjs.com/package/csurf). - **XSS:** Every good templater will escape output by default. **Only** use templaters that do this (such as [Jade](http://jade-lang.com/) or [Nunjucks](https://mozilla.github.io/nunjucks/))! If you need to explicitly escape things, you should consider it insecure - it's too easy to forget to do this, and is practically guaranteed to result in vulnerabilities. - **SQL injection:** Always use parameterized queries. When using MySQL, use the `node-mysql2` module instead of the `node-mysql` module - the latter doesn't use real parameterized queries. Ideally, use something like [Knex](http://knexjs.org/), which will also prevent many other issues, and make your queries much more readable and maintainable. - **Random numbers and values:** Generating unpredictable random numbers is a lot harder than it seems. `Math.random()` will generate numbers that may *seem* random, but are actually quite predictable to an attacker. If you need random values, read [this article](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba) for recommendations. It also goes into more detail about the types of "randomness" that exist. - **Cryptography:** Follow the suggestions in [this gist](https://gist.github.com/tqbf/be58d2d39690c3b366ad). Whatever you do, **do not use the `crypto` module directly**, unless you really have no other choice. Never use pure-Javascript reimplementations - always use bindings to the original implementation, where possible (in the form of native addons). - **Vulnerability advisories:** The Node Security Project keeps track of [known vulnerabilities](https://nodesecurity.io/advisories) in Node.js modules. Services like [VersionEye](https://www.versioneye.com/) will e-mail you, if your project uses a module that is found vulnerable. ### Useful modules:[](https://gist.github.com/joepie91/95ed77b71790442b7e61#useful-modules) This is an incomplete list, and I'll probably be adding stuff to it in the future. - **Determining the type of a value:** [type-of-is](https://www.npmjs.com/package/type-of-is) - **Date/time handling:** [Moment.js](http://momentjs.com/) - **Making HTTP requests:** [bhttp](https://www.npmjs.com/package/bhttp) - **Clean debugging logs:** [debug](https://www.npmjs.com/package/debug) - **Cleaner stacktraces and errors:** [pretty-error](https://www.npmjs.com/package/pretty-error) - **Markdown parsing:** [marked](https://www.npmjs.com/package/marked) - **HTML parsing:** [cheerio](https://www.npmjs.com/package/cheerio) (has a jQuery-like API) - **WebSockets:** [ws](https://www.npmjs.com/package/ws) ### Deployment[](https://gist.github.com/joepie91/95ed77b71790442b7e61#deployment) - **Don't run Node.js as root, ever!** If you want to expose your service at a privileged port (eg. port 80), and you probably do, then you can use [authbind](https://thomashunter.name/blog/using-authbind-with-node-js/) to accomplish that safely. ### Distribution - **Your project is ready for release!** But... you should still pick a license. [This article](http://cryto.net/~joepie91/blog/2013/03/21/licensing-for-beginners/) will give you a very basic introduction to copyright, and the different kind of (common) licenses you can use. ### Scalability[](https://gist.github.com/joepie91/95ed77b71790442b7e61#scalability) **Scalability is a result of your application architecture, not the technologies you pick.** Be wary of anything that claims to be "scalable" - it's much more important to write loosely coupled code with small components, so that you can split out responsibilities across multiple processes and servers. ### Troubleshooting[](https://gist.github.com/joepie91/95ed77b71790442b7e61#troubleshooting) Is something not working properly? Here are some resources that might help: - Is `npm install` causing an error? Use [this error explaining tool](http://cryto.net/why-is-npm-broken/) to find out what's wrong. - `DeprecationWarning: Using Buffer without `new` will soon stop working.` - the solution for this can be found [here](https://gist.github.com/joepie91/a0848a06b4733d8c95c95236d16765aa). ### Optimization[](https://gist.github.com/joepie91/95ed77b71790442b7e61#optimization) The first rule of optimization is: **do not optimize.** The correct order of concerns is security first, then maintainability/readability, and *then* performance. Optimizing performance is something that you shouldn't care about, until you have *hard metrics* showing you that it is needed. If you can't show a performance problem in numbers, it doesn't exist; while it is easy to optimize readable code, it's much harder to make optimized core more readable. There is one exception to this rule: *never* use any methods that end with `Sync` - these are blocking, synchronous methods, and will block your event loop (ie. your entire application) until they have completed. They may look convenient, but they are not worth the performance penalty. Now let's say that you *are* having performance issues. Here are some articles and videos to learn more about how optimization and profiling works in Node.js / V8 - they are going to be fairly in-depth, so you may want to hold off on reading these until you've gotten some practice with Node.js: - [Common causes of deoptimization](https://github.com/petkaantonov/bluebird/wiki/Optimization-killers) - [Monomorphism, and why it is important](http://mrale.ph/blog/2015/01/11/whats-up-with-monomorphism.html) - [Tuning Node.js](https://www.youtube.com/watch?v=FXyM1yrtloc) - [A tour of V8: object representation](http://jayconrod.com/posts/52/a-tour-of-v8-object-representation) - [Node.js in flames](http://techblog.netflix.com/2014/11/nodejs-in-flames.html) - [Realtime Node.js App: A Stress Testing Story (using Socket.IO)](https://bocoup.com/weblog/node-stress-test-analysis) - A bigger list of resources about V8 optimization and internals can be found [here](http://mrale.ph/v8/resources.html). If you're seeing memory leaks, then these may be helpful articles to read: - [Three kinds of memory leaks](https://blog.nelhage.com/post/three-kinds-of-leaks/) These are some modules that you may find useful for profiling your application: - **[node-inspector](https://www.npmjs.com/package/node-inspector):** Based on Chrome Developer Tools, this tool gives you many features, including CPU and heap profiling. Also useful for debugging in general. **Since Node.js v6.3.0, you can also [connect directly](https://www.reddit.com/r/node/comments/4yoane/in_case_you_have_missed_it_node_v630_came_out/) using Chrome Developer Tools.** - **[heapdump](https://www.npmjs.com/package/heapdump):** On-demand heap dumps, for later analysis. Usable from application code *in production*, so very useful for making a heap dump the moment your application goes over a certain heap size. - **[memwatch-next](https://www.npmjs.com/package/memwatch-next):** Provides memory leak detection, and heap diffing. ### Writing C++ addons[](https://gist.github.com/joepie91/95ed77b71790442b7e61#writing-c-addons) You'll usually want to avoid this - C++ is not a memory-safe language, so it's much safer to just write your code in Javascript. V8 is rather well-optimized, so in most cases, performance isn't a problem either. That said, sometimes - eg. when writing bindings to something else - you just *have* to write a native module. These are some resources on that: - [The addon documentation](https://nodejs.org/api/addons.html) - [`nan`, an abstraction layer for making your module work across Node.js versions](https://www.npmjs.com/package/nan) (you should absolutely use this) - [`node-gyp`, the build tool you will need for this purpose](https://github.com/nodejs/node-gyp) - [V8 API documentation for every supported Node.js version](http://v8dox.com/) ### Writing Rust addons[](https://gist.github.com/joepie91/95ed77b71790442b7e61#writing-rust-addons) Neon is a new project that lets you write **memory-safe compiled extensions** for Node.js, using Rust. It's still pretty new, but quite promising - an introduction can be found [here](http://calculist.org/blog/2015/12/23/neon-node-rust/). ### Odds and ends[](https://gist.github.com/joepie91/95ed77b71790442b7e61#odds-and-ends) Some miscellaneous code snippets and examples, that I haven't written a section or article for yet. - **Named logging in Gulp:** [https://gist.github.com/joepie91/e7d66ffdb17d1ea69c56](https://gist.github.com/joepie91/e7d66ffdb17d1ea69c56) - **Cached image:** [https://gist.github.com/joepie91/cee42198b6bc6a24ea44](https://gist.github.com/joepie91/cee42198b6bc6a24ea44) - **Combining Gulp and Electron:** [https://gist.github.com/joepie91/f81cdbc1b45d52ab4b87](https://gist.github.com/joepie91/f81cdbc1b45d52ab4b87) ### Future additions to this list[](https://gist.github.com/joepie91/95ed77b71790442b7e61#future-additions-to-this-list) There are a few things that I'm currently working on documenting, that will be added to this list in the future. I write new documentation as I find the time to do so. - **Node.js for PHP developers** (a migration guide) - In progress. - **A comprehensive guide to Promises** - Planned. - **A comprehensive guide to streams** - Planned. - **Error handling mechanisms and strategies** - Planned. - **Introduction to HTTP** - Planned. - **Writing a secure authentication system** - Planned. - **Writing abstractions** - Planned. # Node.js for PHP developers

This article was originally published at [https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db](https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db). It has not been finished yet, but still contains some useful pointers.

## Learning a second language If PHP was your first language, and this is the first time you're looking to learn another language, you may be tempted to try and "make it work like it worked in PHP". While understandable, this is a **really bad idea**. Different languages have fundamentally different designs, with different best practices, different syntax, and so on. The result of this is that different languages are also better for different usecases. By trying to make one language work like the other, you get the **worst of both worlds** - you lose the benefits that made language one good for your usecase, and add the design flaws of language two. You should always aim to learn a language *properly*, including how it is commonly or optimally used. Your code is going to look and feel considerably different, and that's okay! Over time, you will gain a better understanding of how different language designs carry different tradeoffs, and you'll be able to get the *best* of both worlds. This will take time, however, and you should always start by learning and using each language *as it is* first, to gain a full understanding of it. One thing I explicitly recommend against, is [CGI-Node](http://www.cgi-node.org/) - you should **never, ever, ever use this**. It makes a lot of grandiose claims, but it actually just reimplements some of the worst and most insecure parts of PHP in Node.js. It is also completely unnecessary - the sections below will go into more detail. ## Execution model[](https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db#execution-model) The "execution model" of a language describes how your code is executed. In the case of a web-based application, it decides how your server goes from "a HTTP request is coming in", to "the application code is executed", to "a response has been sent". PHP uses what we'll call the "CGI model" to run your code - for every HTTP request that comes in, the webserver (usually Apache or nginx) will look in your "document root" for a `.php` file with the same path and filename, and then execute that file. This means that for every new request, it effectively starts a new PHP process, with a "clean slate" as far as application state is concerned. Other than `$_SESSION` variables, all the variables in your PHP script are thrown away after a response is sent. This "CGI model" is a somewhat unique execution model, and only a few technologies use it - PHP, ASP and ColdFusion are the most well-known. It's also a very fragile and limited model, that makes it easy to introduce security issues; for example, "uploading a shell" is something that's only possible because of the CGI model. Node.js, however, uses a different model: the "long-running process" model. In this model, your code is not executed *by* a webserver - rather, your code *is* the webserver. Your application is only started once, and once it has started, it will be handling an essentially infinite amount of requests, potentially hundreds or thousands at the same time. Almost every other language uses this same model. This also means that your application state *continues to exist* after a response has been sent, and this makes a lot of projects much easier to implement, because you don't need to constantly store every little thing in a database; instead, you only need to store things in your database that you actually intend to store for a long time. Some of the advantages of the "long-running process" model (as compared to the "CGI model"): - You can share information between requests *without* having to store it in an external database or the session data. - There is a lot less overhead per request, and you can handle more concurrent requests on the same server. - You can continue doing work *after* having sent a response to the client, and there is no time limit. - You can easily implement something that needs a long-running connection, such as applications that are based on WebSockets. - It's not possible for an attacker to "upload a shell". The reason attackers cannot upload a shell, is that there is no direct mapping between a URL and a location on your filesystem. Your application is *explicitly* designed to only execute specific files that are a part of your application. When you try to access a `.js` file that somebody uploaded, it will just send the `.js` file; it won't be executed. There aren't really any disadvantages - while you do have to have a Node.js process running at all times, it can be managed in the same way as any other webserver. You can also use another webserver in front of it; for example, if you want to host multiple domains on a single server. ## Hosting[](https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db#hosting) Node.js applications will not run in most shared hosting environments, as they are designed to *only* run PHP. While there are some 'managed hosting' environments like Heroku that claim to work similarly, they are usually rather expensive and not really worth the money. When deploying a Node.js project in production, you will most likely want to host it on a VPS or a dedicated server. These are full-blown Linux systems that you have full control over, so you can run any application or database that you want. The cheapest option here is to go with an "unmanaged provider". Unmanaged providers are providers whose responsibility ends at the server and the network - they make sure that the system is up and running, and from that point on it's your responsibility to manage your applications. Because they do not provide support for your projects, they are a lot cheaper than "managed providers". My usual recommendations for unmanaged providers are (in no particular order): [RamNode](https://ramnode.com/), [Afterburst](http://afterburst.com/), [SecureDragon](https://securedragon.net/), [Hostigation](http://hostigation.com/) and [RAM Host](http://ramhost.us/). Another popular choice is [DigitalOcean](https://www.digitalocean.com/) - but while their service is stable and sufficient for most people, I personally don't find the performance/resources/price ratio to be good enough. I've also heard good things about [Linode](http://linode.com/), but I don't personally use them - they do, however, apparently provide limited support for your server management. As explained in the previous section, your application *is* the webserver. However, there are some reasons you might still want to run a "generic" webserver in front of your application: - Easier setup of TLS ("SSL"). - Multiple applications for different domains, on the same server ("virtual hosts"). - Slightly faster static file serving. My recommendation for this is [Caddy](https://caddyserver.com/). While nginx is a popular and often-recommended option, it's considerably harder to set up than Caddy, especially for TLS. ## Frameworks[](https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db#frameworks) (this section is a work in progress, these are just some notes left for myself) - execution model - Express - small modules ## Templating[](https://gist.github.com/joepie91/87c5b93a5facb4f99d7b2a65f08363db#templating) If you've already used a templater like Smarty in PHP, here's the short version: use either [Pug](https://pugjs.org/) or [Nunjucks](https://mozilla.github.io/nunjucks/), depending on your preference. Both auto-escape values by default, but I strongly recommend Pug - it understands the actual structure of your template, which gives you more flexibility. If you've been using `include()` or `require()` in PHP along with inline `` statements, here's the long version: The "using-PHP-as-a-templater" approach is quite flawed - it makes it very easy to introduce security issues such as [XSS](http://excess-xss.com/) by accidentally forgetting to escape something. I won't go into detail here, but suffice to say that this is a serious risk, *regardless* of how competent you are as a developer. Instead, you should be using a templater **that auto-escapes values by default, unless you explicitly tell it not to**. [Pug](https://pugjs.org/) and [Nunjucks](https://mozilla.github.io/nunjucks/) are two options in Node.js that do precisely that, and both will work with Express out of the box. # Rendering pages server-side with Express (and Pug)

This article was originally published at [https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1).

### Terminology[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#terminology) - **View:** Also called a "template", a file that contains markup (like HTML) and optionally additional instructions on how to generate snippets of HTML, such as text interpolation, loops, conditionals, includes, and so on. - **View engine:** Also called a "template library" or "templater", ie. a library that implements view functionality, and potentially also a custom language for specifying it (like Pug does). - **HTML templater:** A template library that's designed specifically for generating HTML. It understands document structure and thus can provide useful advanced tools like mixins, as well as more secure output escaping (since it can determine the right escaping approach from the context in which a value is used), but it also means that the templater is not useful for anything other than HTML. - **String-based templater:** A template library that implements templating logic, but that has no understanding of the content it is generating - it simply concatenates together strings, potentially multiple copies of those strings with different values being used in them. These templaters offer a more limited feature set, but are more widely usable. - **Text interpolation / String interpolation:** The insertion of variable values into a string of some kind. Typical examples include ES6 template strings, or this example in Pug: `Hello #{user.username}!` - **Locals:** The variables that are passed into a template, to be used in rendering that template. These are generally specified every time you wish to render a template. Pug is an example of a HTML templater. Nunjucks is an example of a string-based templater. React could technically be considered a HTML templater, although it's not really designed to be used primarily server-side. ### View engine setup[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#view-engine-setup) Assuming you'll be using Pug, this is simply a matter of installing Pug... ``` npm install --save pug ``` ... and then configuring Express to use it: ```javascript let app = express(); app.set("view engine", "pug"); /* ... rest of the application goes here ... */ ``` You won't need to `require()` Pug anywhere, Express will do this internally. You'll likely want to explicitly set the directory where your templates will be stored, as well: ```javascript let app = express(); app.set("view engine", "pug"); app.set("views", path.join(__dirname, "views")); /* ... rest of the application goes here ... */ ``` This will make Express look for your templates in the "views" directory, relative to the file in which you specified the above line. ### Rendering a page[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#rendering-a-page) **homepage.pug:** ``` html body h1 Hello World! p Nothing to see here. ``` **app.js:** ```javascript router.get("/", (req, res) => { res.render("homepage"); }); ``` Express will automatically add an extension to the file. That means that - with our Express configuration - the `"homepage"` template name in the above example will point at `views/homepage.pug`. ### Rendering a page with locals[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#rendering-a-page-with-locals) **homepage.pug:** ``` html body h1 Hello World! p Hi there, #{user.username}! ``` **app.js:** ```javascript router.get("/", (req, res) => { res.render("homepage", { user: req.user }); }); ``` In this example, the `#{user.username}` bit is an example of string interpolation. The "locals" are just an object containing values that the template can use. Since every expression in Pug is written in JavaScript, you can pass *any* kind of valid JS value into the locals, including functions (that you can call from the template). For example, we could do the following as well - although **there's no good reason to do this**, so this is for illustratory purposes only: **homepage.pug:** ``` html body h1 Hello World! p Hi there, #{getUsername()}! ``` **app.js:** ```javascript router.get("/", (req, res) => { res.render("homepage", { getUsername: function() { return req.user; } }); }); ``` ### Using conditionals[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#using-conditionals) **homepage.pug:** ``` html body h1 Hello World! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! ``` **app.js:** ```javascript router.get("/", (req, res) => { res.render("homepage", { user: req.user }); }); ``` Again, the expression in the conditional is just a JS expression. All defined locals are accessible and usable as before. ### Using loops[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#using-loops) **homepage.pug:** ``` html body h1 Hello World! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! p Have some vegetables: ul for vegetable in vegetables li= vegetable ``` **app.js:** ```javascript router.get("/", (req, res) => { res.render("homepage", { user: req.user, vegetables: [ "carrot", "potato", "beet" ] }); }); ``` Note that this... ``` li= vegetable ``` ... is just shorthand for this: ``` li #{vegetable} ``` By default, the contents of a tag are assumed to be a string, optionally with interpolation in one or more places. By suffixing the tag name with `=`, you indicate that the contents of that tag should be a *JavaScript expression* instead. That expression may just be a variable name as well, but it doesn't *have* to be - any JS expression is valid. For example, this is completely okay: ``` li= "foo" + "bar" ``` And this is completely valid as well, *as long as the randomVegetable method is defined in the locals*: ``` li= randomVegetable() ``` ### Request-wide locals[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#request-wide-locals) Sometimes, you want to make a variable available in *every* `res.render` for a request, no matter what route or middleware the page is being rendered from. A typical example is the user object for the current user. This can be accomplished by setting it as a property on the `res.locals` object. **homepage.pug:** ``` html body h1 Hello World! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! p Have some vegetables: ul for vegetable in vegetables li= vegetable ``` **app.js:** ```javascript app.use((req, res, next) => { res.locals.user = req.user; next(); }); /* ... more code goes here ... */ router.get("/", (req, res) => { res.render("homepage", { vegetables: [ "carrot", "potato", "beet" ] }); }); ``` ### Application-wide locals[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#application-wide-locals) Sometimes, a value even needs to be *application-wide* - a typical example would be the site name for a self-hosted application, or other application configuration that doesn't change for each request. This works similarly to `res.locals`, only now you set it on `app.locals`. **homepage.pug:** ``` html body h1 Hello World, this is #{siteName}! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! p Have some vegetables: ul for vegetable in vegetables li= vegetable ``` **app.js:** ```javascript app.locals.siteName = "Vegetable World"; /* ... more code goes here ... */ app.use((req, res, next) => { res.locals.user = req.user; next(); }); /* ... more code goes here ... */ router.get("/", (req, res) => { res.render("homepage", { vegetables: [ "carrot", "potato", "beet" ] }); }); ``` The order of specificity is as follows: `app.locals` are overwritten by `res.locals` of the same name, and `res.locals` are overwritten by `res.render` locals of the same name. In other words: if we did something like this... ```javascript router.get("/", (req, res) => { res.render("homepage", { siteName: "Totally Not Vegetable World", vegetables: [ "carrot", "potato", "beet" ] }); }); ``` ... then the homepage would show "Totally Not Vegetable World" as the website name, while every *other* page on the site still shows "Vegetable World". ### Rendering a page after asynchronous operations[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#rendering-a-page-after-asynchronous-operations) **homepage.pug:** ``` html body h1 Hello World, this is #{siteName}! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! p Have some vegetables: ul for vegetable in vegetables li= vegetable ``` **app.js:** ```javascript app.locals.siteName = "Vegetable World"; /* ... more code goes here ... */ app.use((req, res, next) => { res.locals.user = req.user; next(); }); /* ... more code goes here ... */ router.get("/", (req, res) => { return Promise.try(() => { return db("vegetables").limit(3); }).map((row) => { return row.name; }).then((vegetables) => { res.render("homepage", { vegetables: vegetables }); }); }); ``` Basically the same as when you use `res.send`, only now you're using `res.render`. ### Template inheritance in Pug[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#template-inheritance-in-pug) It would be very impractical if you had to define the *entire* site layout in every individual template - not only that, but the duplication would also result in bugs over time. To solve this problem, Pug (and most other templaters) support *template inheritance*. An example is below. **layout.pug:** ``` html body h1 Hello World, this is #{siteName}! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! block content p This page doesn't have any content yet. ``` **homepage.pug:** ``` extends layout block content p Have some vegetables: ul for vegetable in vegetables li= vegetable ``` **app.js:** ```javascript app.locals.siteName = "Vegetable World"; /* ... more code goes here ... */ app.use((req, res, next) => { res.locals.user = req.user; next(); }); /* ... more code goes here ... */ router.get("/", (req, res) => { return Promise.try(() => { return db("vegetables").limit(3); }).map((row) => { return row.name; }).then((vegetables) => { res.render("homepage", { vegetables: vegetables }); }); }); ``` That's basically all there is to it. You define a `block` in the base template - optionally with default content, as we've done here - and then each template that "extends" (inherits from) that base template can *override* such `block`s. Note that you never render `layout.pug` directly - you still render the page layouts themselves, and they just inherit from the base template. Things of note: - Overriding a `block` is *optional*. If you don't override a `block`, it will simply contain either the default content from the base template (if any is specified), or no content at all (if not). - You can have an unlimited number of `block`s with different names - for example, the one in our example is called `content`. You can decide to override any of them from a template, all of them, or none at all. It's up to you. - You can nest multiple `block`s with different names. This can be useful for more complex layout variations. - You can have multiple levels of inheritance - any template you are inheriting from can itself inherit from another template. This can be especially useful in combination with nested `block`s, for complex cases. ### Static files[](https://gist.github.com/joepie91/c0069ab0e0da40cc7b54b8c2203befe1#static-files) You'll probably also want to serve static files on your site, whether they are CSS files, images, downloads, or anything else. By default, Express ships with `express.static`, which does this for you. All you need to do, is to tell Express where to look for static files. You'll usually want to put `express.static` at the very start of your middleware definitions, so that no time is wasted on eg. initializing sessions when a request for a static file comes in. ```javascript let app = express(); app.set("view engine", "pug"); app.set("views", path.join(__dirname, "views")); app.use(express.static(path.join(__dirname, "public"))); /* ... rest of the application goes here ... */ ``` Your directory structure might look like this: ``` your-project |- node_modules ... |- public | |- style.css | `- logo.png |- views | |- homepage.pug | `- layout.pug `- app.js ``` In the above example, `express.static` will look in the `public` directory for static files, relative to the `app.js` file. For example, if you tried to access `https://your-project.com/style.css`, it would send the user the contents of `your-project/public/style.css`. You can optionally also specify a *prefix* for static files, just like for any other Express middleware: ```javascript let app = express(); app.set("view engine", "pug"); app.set("views", path.join(__dirname, "views")); app.use("/static", express.static(path.join(__dirname, "public"))); /* ... rest of the application goes here ... */ ``` Now, that same `your-project/public/style.css` can be accessed through `https://your-project.com/static/style.css` instead. An example of using it in your **layout.pug**: ``` html head link(rel="stylesheet", href="/static/style.css") body h1 Hello World, this is #{siteName}! if user != null p Hi there, #{user.username}! else p Hi there, unknown person! block content p This page doesn't have any content yet. ``` The slash at the start of `/static/style.css` is important - it tells the browser to ask for it *relative to the domain*, as opposed to *relative to the page URL*. An example of URL resolution without a leading slash: - **Page URL:** `https://your-project.com/some/deeply/nested/page` - **Stylesheet URL:** `static/style.css` - **Resulting stylesheet request URL:** `https://your-project.com/some/deeply/nested/static/style.css` An example of URL resolution *with* the loading slash: - **Page URL:** `https://your-project.com/some/deeply/nested/page` - **Stylesheet URL:** `/static/style.css` - **Resulting stylesheet request URL:** `https://your-project.com/static/style.css` That's it! You do the same thing to embed images, scripts, link to downloads, and so on. # Running a Node.js application using nvm as a systemd service

This article was originally published at [https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa).

Hi there! Since this post was originally written, `nvm` has gained some new tools, and some people have suggested alternative (and potentially better) approaches for modern systems. Make sure to have a look at the comments on the [original Gist](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa), *before* following this guide!

Trickier than it seems. ### 1. Set up nvm[](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa#1-set-up-nvm) Let's assume that you've already created an unprivileged user named `myapp`. You should never run your Node.js applications as root! Switch to the `myapp` user, and do the following: 1. `curl -o- https://raw.githubusercontent.com/creationix/nvm/v0.31.0/install.sh | bash` (however, this will immediately run the nvm installer - you probably want to just download the `install.sh` manually, and inspect it before running it) 2. Install the latest stable Node.js version: `nvm install stable` ### 2. Prepare your application[](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa#2-prepare-your-application) Your package.json must specify a `start` script, that describes what to execute for your application. For example: ```javascript ... "scripts": { "start": "node app.js" }, ... ``` ### 3. Service file[](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa#3-service-file) Save this as `/etc/systemd/system/my-application.service`: ```ini [Unit] Description=My Application [Service] EnvironmentFile=-/etc/default/my-application ExecStart=/home/myapp/start.sh WorkingDirectory=/home/myapp/my-application-directory LimitNOFILE=4096 IgnoreSIGPIPE=false KillMode=process User=myapp [Install] WantedBy=multi-user.target ``` You'll want to change the `User`, `Description` and `ExecStart`/`WorkingDirectory` paths to reflect your application setup. ### 4. Startup script[](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa#4-startup-script) Next, save this as `/home/myapp/start.sh` (adjusting the username in both the path *and* the script if necessary): ```bash #!/bin/bash . /home/myapp/.nvm/nvm.sh npm start ``` This script is necessary, because we can't load nvm via the service file directly. Make sure to make it executable: ```bash chmod +x /home/myapp/start.sh ``` ### 5. Enable and start your service[](https://gist.github.com/joepie91/73ce30dd258296bd24af23e9c5f761aa#5-enable-and-start-your-service) Replace `my-application` with whatever you've named your service file after, running the following **as root**: 1. `systemctl enable my-application` 2. `systemctl start my-application` To verify whether your application started successfully (don't forget to `npm install` your dependencies!), run: ```bash systemctl status my-application ``` ... which will show you the last few lines of its output, whether it's currently running, and any errors that might have occurred. Done! # Persistent state in Node.js

This article was originally published at [https://gist.github.com/joepie91/bf0813626e6568e8633b](https://gist.github.com/joepie91/bf0813626e6568e8633b).

This is an extremely simple example of how you have 'persistent state' when writing an application in Node.js. The `i` variable is shared across all requests, so every time the `/increment` route is accessed, the number is incremented and returned. This may seem obvious, but it works quite differently from eg. PHP, where each HTTP request is effectively a 'clean slate', and you don't have persistent state. Were this written in PHP, then every request would have returned `1`, rather than an incrementing number.

```javascript var i = 0; // [...] app.get("/increment", function(req, res) { i += 1; res.send("Current number: " + i); }) // [...] ``` # node-gyp requirements

This article was originally published at [https://gist.github.com/joepie91/375f6d9b415213cf4394b5ba3ae266ae](https://gist.github.com/joepie91/375f6d9b415213cf4394b5ba3ae266ae). It may no longer be applicable.

### Linux[](https://gist.github.com/joepie91/375f6d9b415213cf4394b5ba3ae266ae#linux) - Python 2.7 (not 3.x!), `build-essential` (make, gcc, etc.) ### Windows[](https://gist.github.com/joepie91/375f6d9b415213cf4394b5ba3ae266ae#windows) - As Administrator: `npm install --global --production windows-build-tools` ### OS X[](https://gist.github.com/joepie91/375f6d9b415213cf4394b5ba3ae266ae#os-x) - Old OS X: [http://osxdaily.com/2012/07/06/install-gcc-without-xcode-in-mac-os-x/](http://osxdaily.com/2012/07/06/install-gcc-without-xcode-in-mac-os-x/) - New OS X: [http://osxdaily.com/2014/02/12/install-command-line-tools-mac-os-x/](http://osxdaily.com/2014/02/12/install-command-line-tools-mac-os-x/) # Introduction to sessions

This article was originally published at [https://gist.github.com/joepie91/cf5fd6481a31477b12dc33af453f9a1d](https://gist.github.com/joepie91/cf5fd6481a31477b12dc33af453f9a1d).

*While a lot of Node.js guides recommend using JWT as an alternative to session cookies (sometimes even mistakenly calling it "more secure than cookies"), this is a terrible idea. JWTs are absolutely **not** a secure way to deal with user authentication/sessions, and [this article](http://cryto.net/~joepie91/blog/2016/06/13/stop-using-jwt-for-sessions/) goes into more detail about that.* Secure user authentication requires the use of *session cookies*. *Cookies* are small key/value pairs that are usually sent by a server, and stored on the client (often a browser). The client then sends this key/value pair back with every request, in a HTTP header. This way, unique clients can be identified between requests, and client-side settings can be stored and used by the server. *Session cookies* are cookies containing a unique *session ID* that is generated by the server. This session ID is used by the server to identify the client whenever it makes a request, and to associate *session data* with that request. *Session data* is arbitrary data that is stored on the server side, and that is associated with a session ID. The client can't see or modify this data, but the server can use the session ID from a request to associate session data with that request. Altogether, this allows for the server to store arbitrary data for a session (that the user can't see or touch!), that it can use on every subsequent request in that session. This is how a website remembers that you've logged in. Step-by-step, the process goes something like this: 1. **Client** requests login page. 2. **Server** sends login page HTML. 3. **Client** fills in the login form, and submits it. 4. **Server** receives the data from the login form, and verifies that the username and password are correct. 5. **Server** creates a new session in the database, containing the ID of the user in the database, and generates a unique session ID for it (which is *not* the same as the user ID!) 6. **Server** sends the session ID to the user as a cookie header, alongside a "welcome" page. 7. **Client** receives the session ID, and saves it locally as a cookie. 8. **Client** displays the "welcome" page that the cookie came with. 9. **User** clicks a link on the welcome page, navigating to his "notifications" page. 10. **Client** retrieves the session cookie from storage. 11. **Client** requests the notifications page, sending along the session cookie (containing the session ID). 12. **Server** receives the request. 13. **Server** looks at the session cookie, and extract the session ID. 14. **Server** retrieves the session data from the database, for the session ID that it received. 15. **Server** associates the session data (containing the user ID) with the request, and passes it on to something that handles the request. 16. **Server request handler** receives the request (containing the session data including user ID), and sends a personalized notifications page for the user with that ID. 17. **Client** receives the personalized notifications page, and displays it. 18. **User** clicks another link, and we go back to step 10. ### Configuring sessions[](https://gist.github.com/joepie91/cf5fd6481a31477b12dc33af453f9a1d#configuring-sessions) Thankfully, you won't have to implement all this yourself - most of it is done for you by existing session implementations. If you're using Express, that implementation would be [express-session](https://github.com/expressjs/session). The `express-session` module doesn't implement the actual session storage itself, it only handles the Express-related bits - for example, it ensures that `req.session` is automatically loaded from and saved to. For the storage of session data, you need to specify a "session store" that's specific to the database you want to use for your session data - and when using Knex, `connect-session-knex` is the best option for that. While full documentation is available in the `express-session` repository, this is what your `express-session` initialization might look like when you're using a relational database like PostgreSQL (through [Knex](http://knexjs.org/)): ```javascript const express = require("express"); const knex = require("knex"); const expressSession = require("express-session"); const KnexSessionStore = require("connect-session-knex")(expressSession); const config = require("./config.json"); /* ... other code ... */ /* You will probably already have a line that looks something like the below. * You won't have to create a new Knex instance for dealing with sessions - you * can just use the one you already have, and the Knex initialization here is * purely for illustrative purposes. */ let db = knex(require("./knexfile")); let app = express(); /* ... other app initialization code ... */ app.use(expressSession({ secret: config.sessions.secret, resave: false, saveUninitialized: false, store: new KnexSessionStore({ knex: db }) })); /* ... rest of the application goes here ... */ ``` #### The configuration example in more detail[](https://gist.github.com/joepie91/cf5fd6481a31477b12dc33af453f9a1d#the-configuration-example-in-more-detail) ```javascript require("connect-session-knex")(expressSession) ``` The `connect-session-knex` module needs access to the `express-session` library, so instead of exporting the session store constructor directly, it exports a *wrapper function*. We call that wrapper function immediately after requiring the module, passing in the `express-session` module, and we get back a session store constructor. ```javascript app.use(expressSession({ secret: config.sessions.secret, resave: false, saveUninitialized: false, store: new KnexSessionStore({ knex: db }) })); ``` This is where we 1) create a new `express-session` middleware, and 2) `app.use` it, so that it processes every request, attaching session data where needed. ```javascript secret: config.sessions.secret, ``` Every application should have a "secret" for sessions - essentially a secret key that will be used to cryptographically sign the session cookie, so that the user can't tamper with it. This should be a *random* value, and it should be stored in a configuration file. You should *not* store this value (or any other secret values) in the source code directly. On Linux and OS X, a quick way to generate a [securely random](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba) key is the following command: `cat /dev/urandom | env LC_CTYPE=C tr -dc _A-Za-z0-9 | head -c${1:-64}` ```javascript resave: false, ``` When `resave` is set to `true`, `express-session` will *always* save the session data after every request, regardless of whether the session data was modified. This can cause race conditions, and therefore you usually don't want to do this, but with some session stores it's necessary as they don't let you reset the "expiry timer" without saving all the session data again. `connect-session-knex` doesn't have this problem, and so you should set it to `false`, which is the safer option. If you intend to use a different session store, you should consult the `express-session` documentation for more details about this option. ```javascript saveUninitialized: false, ``` If the user doesn't have a session yet, a brand new `req.session` object is created for them on their first request. This setting determines whether that session should be saved to the database, *even* if no session data was stored into it. Setting it to `false` makes it so that the session is only saved if it's actually *used* for something, and that's the setting you want here. ```javascript store: new KnexSessionStore({ knex: db }) ``` This tells `express-session` where to store the actual session data. In the case of `connect-session-knex` (which is where `KnexSessionStore` comes from), we need to pass in an existing Knex instance, which it will then use for interacting with the `sessions` table. Other options can be found in the [`connect-session-knex` documentation](https://www.npmjs.com/package/connect-session-knex). ### Using sessions[](https://gist.github.com/joepie91/cf5fd6481a31477b12dc33af453f9a1d#using-sessions) The usage of sessions is quite simple - you simply set properties on `req.session`, and you can then access those properties from other requests within the same session. For example, this is what a login route might look like (assuming you're using Knex, [`scrypt-for-humans`](https://www.npmjs.com/package/scrypt-for-humans), and a custom `AuthenticationError` created with [`create-error`](https://www.npmjs.com/package/create-error)): ```javascript router.post("/login", (req, res) => { return Promise.try(() => { return db("users").where({ username: req.body.username }); }).then((users) => { if (users.length === 0) { throw new AuthenticationError("No such username exists"); } else { let user = users[0]; return Promise.try(() => { return scryptForHumans.verifyHash(req.body.password, user.hash); }).then(() => { /* Password was correct */ req.session.userId = user.id; res.redirect("/dashboard"); }).catch(scryptForHumans.PasswordError, (err) => { throw new AuthenticationError("Invalid password"); }); } }); }); ``` And your `/dashboard` route might look like this: ```javascript router.get("/dashboard", (req, res) => { return Promise.try(() => { if (req.session.userId == null) { /* User is not logged in */ res.redirect("/login"); } else { return Promise.try(() => { return db("users").where({ id: req.session.userId }); }).then((users) => { if (users.length === 0) { /* User no longer exists */ req.session.destroy(); res.redirect("/login"); } else { res.render("dashboard", { user: users[0]; }); } }); } }); }); ``` In this example, `req.session.destroy()` will - like the name suggests - destroy the session, essentially returning the user to a session-less state. In practice, this means they get "logged out". Now, if you had to do all that logic for *every* route that requires the user to be logged in, it would get rather unwieldy. So let's move it out into some middleware: ```javascript function requireLogin(req, res, next) { return Promise.try(() => { if (req.session.userId == null) { /* User is not logged in */ res.redirect("/login"); } else { return Promise.try(() => { return db("users").where({ id: req.session.userId }); }).then((users) => { if (users.length === 0) { /* User no longer exists */ req.session.destroy(); res.redirect("/login"); } else { req.user = users[0]; next(); } }); } }); } router.get("/dashboard", requireLogin, (req, res) => { res.render("dashboard", { user: req.user }); }); ``` Note the following: - We now have a separate `requireLogin` function that verifies whether the user is logged in. - That same function also sets `req.user` if they *are* logged in, with their user data, before calling `next()` (which passes control to the next middleware/route). - Instead of only specifying a path and a route in the `router.get` call, we now specify our `requireLogin` middleware as well. It will get called before the route, and the route is *only* ever called if the `requireLogin` middleware calls `next()` (which it only does for logged-in users). # Secure random values

This article was originally published at [https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba).

Not all random values are created equal - for security-related code, you need a *specific kind* of random value. A summary of this article, if you don't want to read the entire thing: - **Don't use `Math.random()`.** There are *extremely* few cases where `Math.random()` is the right answer. Don't use it, unless you've read this *entire* article, and determined that it's necessary for your case. - **Don't use `crypto.getRandomBytes` directly.** While it's a CSPRNG, it's easy to bias the result when 'transforming' it, such that the output becomes more predictable. - **If you want to generate random tokens or API keys:** Use [`uuid`](https://www.npmjs.com/package/uuid), specifically the `uuid.v4()` method. Avoid `node-uuid` - it's not the same package, and doesn't produce reliably secure random values. - **If you want to generate random numbers in a range:** Use [`random-number-csprng`](https://www.npmjs.com/package/random-number-csprng). You should seriously consider reading the entire article, though - it's not *that* long :) ### Types of "random"[](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba#types-of-random) There exist roughly three types of "random": - **Truly random:** Exactly as the name describes. True randomness, to which no pattern or algorithm applies. It's debatable whether this really exists. - **Unpredictable:** Not *truly* random, but impossible for an attacker to predict. This is what you need for security-related code - it doesn't matter *how* the data is generated, as long as it can't be guessed. - **Irregular:** This is what most people think of when they think of "random". An example is a game with a background of a star field, where each star is drawn in a "random" position on the screen. This isn't truly random, and it isn't even unpredictable - it just doesn't *look* like there's a pattern to it, visually. *Irregular* data is fast to generate, but utterly worthless for security purposes - even if it doesn't seem like there's a pattern, there is almost always a way for an attacker to predict what the values are going to be. The only realistic usecase for irregular data is things that are represented visually, such as game elements or randomly generated phrases on a joke site. *Unpredictable* data is a bit slower to generate, but still fast enough for most cases, and it's sufficiently hard to guess that it will be attacker-resistant. Unpredictable data is provided by what's called a **CSPRNG**. ### Types of RNGs (Random Number Generators)[](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba#types-of-rngs-random-number-generators) - **CSPRNG:** A *Cryptographically Secure Pseudo-Random Number Generator*. This is what produces *unpredictable* data that you need for security purposes. - **PRNG:** A *Pseudo-Random Number Generator*. This is a broader category that includes CSPRNGs *and* generators that just return irregular values - in other words, you *cannot* rely on a PRNG to provide you with unpredictable values. - **RNG:** A *Random Number Generator*. The meaning of this term depends on the context. Most people use it as an even *broader* category that includes PRNGs and *truly* random number generators. Every random value that you need for security-related purposes (ie. anything where there exists the possibility of an "attacker"), should be generated using a **CSPRNG**. This includes verification tokens, reset tokens, lottery numbers, API keys, generated passwords, encryption keys, and so on, and so on. ### Bias[](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba#bias) In Node.js, the most widely available CSPRNG is the `crypto.randomBytes` function, but *you shouldn't use this directly*, as it's easy to mess up and "bias" your random values - that is, making it more likely that a specific value or set of values is picked. A common example of this mistake is using the `%` modulo operator when you have less than 256 possibilities (since a single byte has 256 possible values). Doing so actually makes lower values *more likely* to be picked than higher values. For example, let's say that you have 36 possible random values - `0-9` plus every lowercase letter in `a-z`. A naive implementation might look something like this: ```javascript let randomCharacter = randomByte % 36; ``` **That code is broken and insecure.** With the code above, you essentially create the following ranges (all inclusive): - **0-35** stays 0-35. - **36-71** becomes 0-35. - **72-107** becomes 0-35. - **108-143** becomes 0-35. - **144-179** becomes 0-35. - **180-215** becomes 0-35. - **216-251** becomes 0-35. - **252-255** becomes *0-3*. If you look at the above list of ranges you'll notice that while there are **7 possible values** for each `randomCharacter` between 4 and 35 (inclusive), there are **8 possible values** for each `randomCharacter` between 0 and 3 (inclusive). This means that while there's a **2.64% chance** of getting a value between 4 and 35 (inclusive), there's a **3.02% chance** of getting a value between 0 and 3 (inclusive). This kind of difference may *look* small, but it's an easy and effective way for an attacker to reduce the amount of guesses they need when bruteforcing something. And this is only *one* way in which you can make your random values insecure, despite them originally coming from a secure random source. ### So, how do I obtain random values securely?[](https://gist.github.com/joepie91/7105003c3b26e65efcea63f3db82dfba#so-how-do-i-obtain-random-values-securely) In Node.js: - **If you need a sequence of random bytes:** Use [`crypto.randomBytes`](https://nodejs.org/dist/latest-v18.x/docs/api/crypto.html#cryptorandombytessize-callback). - **If you need individual random numbers in a certain range:** use [`crypto.randomInt`](https://nodejs.org/dist/latest-v18.x/docs/api/crypto.html#cryptorandomintmin-max-callback). - **If you need a random string:** You have two good options here, depending on your needs. 1. Use a v4 UUID. Safe ways to generate this are [`crypto.randomUUID`](https://nodejs.org/dist/latest-v18.x/docs/api/crypto.html#cryptorandomuuidoptions), and [the `uuid` library](https://www.npmjs.com/package/uuid) (only the v4 variant!). 2. Use a nanoid, using the [`nanoid` library](https://www.npmjs.com/package/nanoid). This also allows specifying a custom alphabet to use for your random string. Both of these use a CSPRNG, and 'transform' the bytes in an unbiased (ie. secure) way. In the browser: - When using the Node.js options, your bundler *should* automatically select equivalently safe browser implementations for all of these. - If not using a bundler: - **If you need a sequence of random bytes:** Use [`crypto.getRandomValues`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/getRandomValues) with a `Uint8Array`. Other array types will get you numbers in different ranges. - **If you need a random string:** You have two good options here, depending on your needs. 1. Use a v4 UUID, with the [`crypto.randomUUID`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID) method. 2. Use a nanoid, using the **standalone build** of the [`nanoid` library](https://github.com/ai/nanoid#install). This also allows specifying a custom alphabet to use for your random string. However, it is **strongly** recommended that you use a bundler, in general. # Checking file existence asynchronously

This article was originally published at [https://gist.github.com/joepie91/bbf495e044da043de2ba](https://gist.github.com/joepie91/bbf495e044da043de2ba).

Checking whether a file exists before doing something with it, can lead to race conditions in your application. Race conditions are extremely hard to debug and, depending on where they occur, they can lead to **data loss or security holes**. Using the synchronous versions will **not** fix this. Generally, just do what you want to do, and handle the error if it doesn't work. This is much safer. - **If you want to check whether a file exists, before reading it:** just try to open the file, and handle the `ENOENT` error when it doesn't exist. - **If you want to make sure a file doesn't exist, before writing to it:** open the file using an [exclusive mode](https://nodejs.org/api/fs.html#fs_fs_open_path_flags_mode_callback), eg. `wx` or `ax`, and handle the error when the file already exists. - **If you want to create a directory:** just try to create it, and handle the error if it already exists. - **If you want to remove a file or directory:** just try to [unlink](https://nodejs.org/api/fs.html#fs_fs_unlink_path_callback) the path, and handle the error if it doesn't exist. If you're *really, really sure* that you need to use `fs.exists` or `fs.stat`, then you can use the example code below to do so asynchronously. If you just want to know how to promisify an asynchronous callback that doesn't follow the nodeback convention, then you can look at the example below as well.

You should almost never actually use the code below. The same applies to `fs.stat` (when used for checking existence). Make sure you have read the text above first!

```javascript const fs = require("fs"); const Promise = require("bluebird"); function existsAsync(path) { return new Promise(function(resolve, reject){ fs.exists(path, function(exists){ resolve(exists); }) }) } ``` # Fixing "Buffer without new" deprecation warnings

This article was originally published at [https://gist.github.com/joepie91/a0848a06b4733d8c95c95236d16765aa](https://gist.github.com/joepie91/a0848a06b4733d8c95c95236d16765aa). Newer Node.js versions no longer behave in this exact way, but the information is kept here for posterity. If you have code that still uses `new Buffer`, you should still update it.

If you're using Node.js, you might run into a warning like this: ``` DeprecationWarning: Using Buffer without `new` will soon stop working. ``` The reason for this warning is that the Buffer creation API was changed to require the use of `new`. However, contrary to what the warning says, you should *not* use `new Buffer` either, [for security reasons](https://github.com/ChALkeR/notes/blob/master/Buffer-knows-everything.md). Any usage of it must be converted *as soon as possible* to [`Buffer.from`, `Buffer.alloc`, or `Buffer.allocUnsafe`](https://nodejs.org/api/buffer.html#buffer_buffer_from_buffer_alloc_and_buffer_allocunsafe), depending on what it's being used for. Not changing it could mean a **security vulnerability** in your code. ### Where is it coming from?[](https://gist.github.com/joepie91/a0848a06b4733d8c95c95236d16765aa#where-is-it-coming-from) Unfortunately, the warning doesn't indicate *where* the issue comes from. If you've verified that *your own code* doesn't use `Buffer` without `new` anymore, but you're still getting the warning, then you are probably using an (outdated) dependency that still uses the old API. The following command (for Linux and Cygwin) will list all the affected modules: ```bash grep -rP '(?If you're on OS X, your `sort` tool will not have the `-h` flag. Therefore, you'll want to run this instead (but the result won't be sorted by frequency): ```bash grep -rP '(?](https://gist.github.com/joepie91/a0848a06b4733d8c95c95236d16765aa#how-do-i-fix-it) If the issue is in your own code, [this documentation](https://nodejs.org/api/buffer.html#buffer_buffer_from_buffer_alloc_and_buffer_allocunsafe) will explain how to migrate. If you're targeting older Node.js versions, you may want to use the [`safe-buffer` shim](https://www.npmjs.com/package/safe-buffer) to maintain compatibility. If the issue is in a third-party library: 1. Run `npm ls ` to determine where in your dependency tree it is installed, and look at the top-most dependency (that isn't your project itself) that it originates from. 2. If that top-most dependency is out of date, try updating the dependency first, to see if the warning goes away. 3. If the dependency is *up-to-date*, that means it's an unfixed issue in the dependency. You should create an issue ticket (or, even better, a pull request) on the dependency's repository, asking for it to be fixed. # Why you shouldn't use Sails.js

This article was originally published at [https://gist.github.com/joepie91/cc8b0c9723cc2164660e](https://gist.github.com/joepie91/cc8b0c9723cc2164660e).

This article was published in 2015. Since then, the situation may have changed, and this article is kept for posterity. You should verify whether the issues still apply when making a decision

A large list of reasons why to avoid Sails.js and Waterline: [https://kev.inburke.com/kevin/dont-use-sails-or-waterline/](https://kev.inburke.com/kevin/dont-use-sails-or-waterline/) Furthermore, the CEO of Balderdash, the company behind Sails.js, stated the following: > > "we promise to push a fix within 60 days", > > @kevinburkeshyp This would amount to a Service Level Agreement with the entire world; this is generally not possible, and does not exist in any software project that I know of. Upon notifying him in the thread that I actually offer [exactly that guarantee](http://cryto.net/~joepie91/), and that his statement was thus incorrect, he accused me of "starting a flamewar", and proceeded to [delete my posts](https://github.com/balderdashy/sails/issues/2830).

**UPDATE:** The issue has been [reopened](https://github.com/balderdashy/sails/issues/2830#issuecomment-140794914) by the founder of Balderdash. Mind that this article was written back when this was not the case yet, and judge appropriately.

He is apparently also unaware that Google Project Zero expects the exact same - a hard deadline of 90 days, after which an issue is publicly disclosed. Now, just locking the thread would have been at least somewhat justifiable - he might have legitimately misconstrued my statement as inciting a flamewar. What is **not** excusable, however, is removing my posts that show his (negligent) statement is wrong. This raises serious questions about what the Sails maintainers consider more important: their reputation, or the actual security of their users. It would have been perfectly possible to just leave the posts intact - the thread would be locked, so a flamewar would not have been a possibility, and each reader could make up their own mind about the state of things. In short: **Avoid Sails.js. They do not have your best interests at heart, and this could result in serious security issues for your project.** For reference, the full thread is below, pre-deletion. [![image.png](https://wiki.slightly.tech/uploads/images/gallery/2024-12/scaled-1680-/ByRFQt3LL5tDIoVf-image.png)](https://wiki.slightly.tech/uploads/images/gallery/2024-12/ByRFQt3LL5tDIoVf-image.png) # Building desktop applications with Node.js ### Option 1: Electron This is the most popular and well-supported option. Electron is a combination of Node.js and Chromium Embedded Framework, and so it will give you access to the feature sets of both. The main tradeoff is that it doesn't give you much direct control over the window or the system integration. #### Benefits - Cross-platform - Well-supported, with a large developer base and a lot of (third-party) documentation - Works pretty much out of the box, and lets you use HTML and CSS - Can use native Node.js modules #### Drawbacks - Relatively high baseline memory usage; expect 50-100MB of RAM before running any application code. This is fine for most applications, but probably not for tiny utilities. - Somewhat restrictive; does not give you much control over the system integration, instead has a default setup that's okay for most purposes and abstracts away platform-specific things for the most part. - Limited OpenGL support; only WebGL is available. ### Option 2: SDL Using [https://www.npmjs.com/package/@kmamal/sdl](https://www.npmjs.com/package/@kmamal/sdl) and [https://www.npmjs.com/package/@kmamal/gl](https://www.npmjs.com/package/@kmamal/gl), you can use SDL and OpenGL directly from Node.js. This will take care of window creation, input handling, and so on - but you will have to do all the drawing yourself using shaders. A full (low-level) example is available [here](https://github.com/kmamal/node-sdl/blob/master/examples/07-webgl-drawing/index.js), and you can also [use regl](https://github.com/kmamal/node-sdl/blob/master/examples/08-webgl-regl/index.js) to simplify things a bit. For text rendering, you may wish to use Pango or Harfbuzz, which can both be used through the [node-gtk](https://github.com/romgrk/node-gtk) library (which, despite the name, is a generic GObject Introspection library rather than anything specific to the GTK UI toolkit). #### Benefits - Direct OpenGL access - Does not enforce any particular structure on your project - Good selection of [examples](https://github.com/kmamal/node-sdl/tree/master/examples) #### Drawbacks - You have to do all of the drawing yourself; there are no widgets, there is no CSS, and so on. You will be writing OpenGL shaders. There is support for canvas-style drawing, but it is not fast. - More research required to understand how to use it; not a lot of people use these libraries, and there are not very many tutorials. ### Option 3: FFI bindings You can also use an existing UI library that's written in C, C++ or Rust, by using a generic FFI library that lets you call the necessary functions from Javascript code in Node.js directly. For C, a good option is [Koffi](https://koffi.dev/), which has excellent documentation. For Rust, a good option is [Neon](https://neon-rs.dev/), whose documentation is not quite as extensive as that of Koffi, but still pretty okay. ### Option 4: GTK The aforementioned [node-gtk](https://github.com/romgrk/node-gtk) library can also be used to use GTK directly. Very little documentation is available about this, so you'll likely be stuck reading the GTK documentation (for its C API) and mentally translating to what the equivalent in the bindings would be. # lmdb-js Quick Reference Abbreviated documentation for [https://www.npmjs.com/package/lmdb](https://www.npmjs.com/package/lmdb), for easier reference once you already understand how the library works. ```javascript "use strict"; const lmdb = require("lmdb"); // recommended write strategy: conditional writes // if path contains . it's a file, otherwise it's a directory, or if null it's in-memory let db = lmdb.open({ path: "/path/to/db", ... options, ... rootOptions }); // root database let subDB = db.openDB("sub-db name", ... options); let options = { compression: boolean || { threshold: integer, dictionary: Buffer }, useVersions: boolean, // entries have version numbers sharedStructuresKey: Symbol, // stores values more efficiently by having a central key/structure mapping encoding: "msgpack" (default) || "json" || "cbor" || "string" || "ordered-binary" || "binary", encoder: object_settings || msgpack: { structuredClone, useFloat32 } || { encode: Function, decode: Function }, cache: boolean || object_weakLRUCacheSettings, // if enabled, child transactions and rollbacks will not be available keyEncoding: "uint32" || "binary" || "ordered-binary" (default), keyEncoder: Function, dupSort: boolean, // keys have multiple values; use encoding=ordered-binary and getValues(), ifVersion will not be available strictAsyncOrder: boolean }; let rootOptions = { path: string, maxDbs: integer, // default: 12 maxReaders: integer, overlappingSync: boolean, separateFlushed: boolean, pageSize: integer, // set 4096 (default) for fits-in-memory, 8192 for larger especially for range queries eventTurnBatching: boolean, // default: true txnStartThreshold: integer, // only relevant when eventTurnBatching=false encryptionKey: Buffer || string, // 32 bytes commitDelay: integer, // in ms }; // Existence (always synchronous) let exists = db.doesExist(key); let exists = db.doesExist(key, version); // for single-value let exists = db.doesExist(key, value); // for multi-value // Reads (always synchronous) let value = db.get(key, options); // value = undefined || single value let entry = db.getEntry(key, options); // for single-value; entry = undefined || { value, version } let iterator = db.getValues(key); // for multi-value; iterator (see range/search for special forms) let version = db.getLastVersion(); // version = integer; version of last `get` call. not available when `cache` is enabled // Specialized reads let values = await db.getMany(keys); // optimized db.get that prefetches first to not block main thread let valueEncoded = db.getBinary(key); // skip value decode // Range/search (always synchronous) let iterator = db.getRange(rangeOptions); // iterator<{ key, value }>, has lazy and optionally async map/filter/forEach let iterator = db.getKeys(rangeOptions); // iterator; key only returned once for multi-value entries let iterator = db.getValues(key, rangeOptions); // for multi-value; returns all values for a key (start/end affect values, not keys) let rangeOptions = { start: value, end: value, reverse: boolean, offset: integer, limit: integer, asArray: boolean // greedy! }; // Mutations let success = await db.put(key, value, version, ifVersion); // success = true if stored, false on ifVersion mismatch let success = await db.remove(key, ifVersion); // for single-value; success = true if deleted, false on ifVersion mismatch let success = await db.remove(key, value); // for multi-value; success = true if value deleted // Conditionals let success = await db.ifVersion(key, ifVersion, () => { ... }); let success = await db.ifNoExists(key, () => { ... }); // Synchronous versions of mutations let success = db.putSync(key, value, options); // SLOW; versionOrOptions = { append, appendDup, noOverwrite, noDupData, version } let success = db.removeSync(key, value); // SLOW let success = db.removeSync(key, ifVersion); // SLOW // Database-wide mutations await db.clearAsync(); // removes all entries db.clearSync(); // same but synchronous await db.dropAsync(); // remove all entries *and* deletes the database db.dropSync(); // same but synchronous // Utilities lmdb.asBinary(buffer); // Mark buffer as 'already encoded' (stored as-is) rather than literal buffer (which gets type-tagged) await db.committed; // Wait for all currently pending writes to be committed to the database (in memory) await db.flushed; // Wait for all currently pending writes to be flushed to disk await db.prefetch(keys); // Preload specified keys into memory await db.backup(path); // Stores an internally consistent copy of the database at the specified path db.on("beforecommit", () => { /* ... */ }); // Fires just before commit to disk, allows async ops, forces eventTurnBatching on // Transactions // A transaction callback is called at some later time when processing database operations, and a single database commit may // also contain operations from outside of the transaction, in addition to the transaction. The return value gets passed through. let returnValue = await db.transaction(() => { /* ... series of DB operations here ... */ }); // Child transactions can be used to support rollbacks; if something throws within one, the changes are rolled back and the // transaction will be aborted. Because transaction management is synchronously stateful, no reference to the parent transaction // is needed. It's also possible to create a child transaction *outside of* any parent transaction, in which case it will be rolled // into a default(?) transaction on the next commit. let returnValue = await db.childTransaction(() => { /* ... series of DB operations here ... */ }); // Asynchronous transaction callbacks are *possible* but since transaction management is stateful, this can result in unrelated // operations unexpectedly ending up in a transaction that they don't belong to. So normally you *shouldn't* await inside of a // transaction, even though you would use the asynchronous versions of operations. Instead of checking for failure asynchronously, // rely on the transaction abort/rollback. // There is a synchronous transaction equivalent, which doesn't batch? and executes immediately let returnValue = db.transactionSync(() => { /* ... series of DB operations here ... */ }); // And there are also read transactions, providing a consistent view to read from let transaction = db.useReadTransaction(); /* ... read-only operations go here ... */ transaction.done(); // Do not forget! Or it will leak ``` # Fixing node: prefixed requires/imports in Browserify If you're running into issues with Browserify and prefixed module names (like in `require("node:fs")`), and you cannot change the imports because eg. they exist within a third-party module, then this workaround will fix that.

This *doesn't* work for newer core modules like `node:test`, which are *only* possible to `require` using the prefix; it only works for core modules that allow importing both with and without the prefix.

You can use [browserify-replace](https://www.npmjs.com/package/browserify-replace) to automatically replace every occurrence of the prefix in every module; you need to **ensure that this comes *before* any other transform**, including before things like Babel. For example: ```javascript app.use("/bundle.js", watchifyMiddleware(browserify("src/ui/index.jsx", { basedir: __dirname, debug: true, cache: {}, extensions: [".jsx"], transform: [ ["browserify-replace", { global: true, replace: [ { from: '"node:([a-z_-]+)"', to: '"$1"' }, { from: "'node:([a-z_-]+)'", to: "'$1'" }, ] }], ["babelify", { presets: ["@babel/preset-env", "@babel/preset-react"], }], ] }))); ``` The `browserify-replace` transform is what makes it work; the `global` flag is what makes it work *for everything within `node_modules` too*. The rest of the example is not relevant to make it work, this code is just taken from a project of mine. What it does is do a *string-replace* to strip the prefix off everything that it finds. Because it's a string replace, it doesn't need to go through any (language-aware) processing steps that might stumble over the unsupported import path, so this should work for eg. JSX and Typescript as well. It's still a hack and it'll match *any* string that has this format, so I wouldn't recommend relying on this in the long term, and I'd suggest looking into a better-maintained bundler like Parcel nowadays. But to keep things running in the short term, this should suffice. # NixOS # Setting up Bookstack Turned out to be pretty simple. ``` deployment.secrets.bookstack-app-key = { source = "../private/bookstack/app-key"; destination = "/var/lib/bookstack/app-key"; owner = { user = "bookstack"; group = "bookstack"; }; permissions = "0700"; }; services.bookstack = { enable = true; hostname = "wiki.slightly.tech"; maxUploadSize = "10G"; appKeyFile = "/var/lib/bookstack/app-key"; nginx = { enableACME = true; forceSSL = true; }; database = { createLocally = true; }; }; ``` Server was running an old version of NixOS, 23.05, where MySQL doesn't work in a VPS (anymore). Upgraded the whole thing to 24.11 and then it Just Worked. Afterwards, run: ```bash bookstack bookstack:create-admin ``` ... in a terminal on the server to set up the primary administrator account. Done. # New Page # A *complete* listing of operators in Nix, and their predence.

This article was originally published at [https://gist.github.com/joepie91/c3c047f3406aea9ec65eebce2ffd449d](https://gist.github.com/joepie91/c3c047f3406aea9ec65eebce2ffd449d).

The information in this article has since been absorbed into the official Nix manual. It is kept here for posterity. It may be outdated by the time you read this.

Lower precedence means a stronger binding; ie. this list is sorted from strongest to weakest binding, and in the case of equal precedence between two operators, the associativity decides the binding.

Prec	Abbreviation	Example	Assoc	Description
1	SELECT	`e . attrpath [or def]`	none	Select attribute denoted by the attribute path `attrpath` from set `e`. (An attribute path is a dot-separated list of attribute names.) If the attribute doesn’t exist, return `default` if provided, otherwise abort evaluation.
2	APP	`e1 e2`	left	Call function `e1` with argument `e2`.
3	NEG	`-e`	none	Numeric negation.
4	HAS\_ATTR	`e ? attrpath`	none	Test whether set `e` contains the attribute denoted by `attrpath`; return true or false.
5	CONCAT	`e1 ++ e2`	right	List concatenation.
6	MUL	`e1 * e2`	left	Numeric multiplication.
6	DIV	`e1 / e2`	left	Numeric division.
7	ADD	`e1 + e2`	left	Numeric addition, or string concatenation.
7	SUB	`e1 - e2`	left	Numeric subtraction.
8	NOT	`!e`	left	Boolean negation.
9	UPDATE	`e1 // e2`	right	Return a set consisting of the attributes in `e1` and `e2` (with the latter taking precedence over the former in case of equally named attributes).
10	LT	`e1 < e2`	left	Less than.
10	LTE	`e1 <= e2`	left	Less than or equal.
10	GT	`e1 > e2`	left	Greater than.
10	GTE	`e1 >= e2`	left	Greater than or equal.
11	EQ	`e1 == e2`	none	Equality.
11	NEQ	`e1 != e2`	none	Inequality.
12	AND	`e1 && e2`	left	Logical AND.
13	OR	`e1 \|\| e2`	left	Logical OR.
14	IMPL	`e1 -> e2`	none	Logical implication (equivalent to `!e1 \|\| e2`).

# Setting up Hydra

This article was originally published at [https://gist.github.com/joepie91/c26f01a787af87a96f967219234a8723](https://gist.github.com/joepie91/c26f01a787af87a96f967219234a8723) in 2017. The NixOS ecosystem constantly changes, and it may not be relevant anymore by the time you read this article.

Just some notes from my attempt at setting up Hydra. ### Setting up on NixOS[](https://gist.github.com/joepie91/c26f01a787af87a96f967219234a8723#setting-up-on-nixos) No need for manual database creation and all that; just ensure that your PostgreSQL service is running (`services.postgresql.enable = true;`), and then enable the Hydra service (`services.hydra.enable`). The Hydra service will need a few more options to be set up, below is my configuration for it: ``` services.hydra = { enable = true; port = 3333; hydraURL = "http://localhost:3333/"; notificationSender = "hydra@cryto.net"; useSubstitutes = true; minimumDiskFree = 20; minimumDiskFreeEvaluator = 20; }; ``` Database and user creation and all that will happen automatically. You'll only need to run `hydra-init` and then `hydra-create-user` to create the first user. Note that you *may* need to run these scripts as root if you get permission or filesystem errors. ### Can't run `hydra-*` utility scripts / access the web interface due to database errors[](https://gist.github.com/joepie91/c26f01a787af87a96f967219234a8723#cant-run-hydra--utility-scripts--access-the-web-interface-due-to-database-errors) If you already have a `services.postgresql.authentication` configuration line from elsewhere (either another service, or your own `configuration.nix`), it may be conflicting with the one specified in the Hydra service. There's an open issue about it [here](https://github.com/NixOS/nixpkgs/issues/32063). ### Can't login[](https://gist.github.com/joepie91/c26f01a787af87a96f967219234a8723#cant-login) After running `hydra-create-user` in your shell, you may be running into the following error in the web interface: "Bad username or password." When this occurs, it's likely because the `hydra-*` utility scripts stored your data in a local SQLite database, rather than the PostgreSQL database you configured. As far as I can tell, this happens because of some missing `HYDRA_*` environment variables that are set through `/etc/profile`, which is only applied on your next login. Simply opening a new shell is not enough. As a workaround until your next login/boot, you can run the following to obtain the command you need to run to apply the new environment variables in your current shell: ```bash cat /etc/profile | grep set-environment ``` ... and then run the resulting command (including the dot at the start, if there is one!) in the shell you intend to run the `hydra-*` scripts in. If you intend to run them as root, make sure you run the `set-environment` script *in the root shell* - using `sudo` will make the environment variables get lost, so you'll be stuck with the same issue as before. # Fixing root filesystem errors with fsck on NixOS If you run into an error like this: ```css An error occurred in stage 1 of the boot process, which must mount the root filesystem on `/mnt-root` and then start stage 2. Press one of the following keys: r) to reboot immediately *) to ignore the error and continue ``` Then you can fix it like this: 1. Boot into a live CD/DVD for NixOS, or some other environment that has `fsck` installed, but *not* your installed copy of NixOS (as that will mount the root filesystem) [(source)](https://discourse.nixos.org/t/how-to-fix-a-slightly-broken-root-filesystem-on-nixos-vm-image/19972/4?u=joepie91) 2. Run `fsck -yf /dev/sda1` where you replace `/dev/sda1` with your root filesystem. [(source)](https://askubuntu.com/questions/885062/root-file-system-requires-manual-fsck) - If you're on a (KVM) VPS, it'll probably be `/dev/vda1`. If you're using LVM (even on a VPS), then you need to specify your logical volume instead (eg. `/dev/vg_main/lv_root`, but it depends on what you've named it).

The above command will automatically agree to whatever suggestion fsck makes. **This can technically lead to data loss!**

Many distributions will give you an option to drop down into a shell from the error directly, but NixOS does not do that. In theory you could add the `boot.shell_on_fail` flag to the boot options for your existing installation, but for reasons that I didn't bother debugging any further, the installed `fsck` was unable to fix the issues. # Stepping through builder steps in your custom packages