This post is part of my developer-friendly security series. You can find all other posts here.

XSS is a vulnerability that allows attackers to run arbitrary JavaScript code in applications they shouldn’t be able to control. This can lead to complete account compromises for every victim that follows a malicious link or visits a compromised page.

There are two major families of XSS: server side and client side. Today we’ll discover how to better protect against server-side XSS, how to reduce the risk of client-side XSS (with some novel technologies) and how to mitigate both of them with some defense-in-depth mechanisms. In the end there will be a recap (with code!) to put together the most important bits you shouldn’t forget.

Server-Side XSS

Most programs and web applications need to store and/or send back some data to the users. Modern web applications have hundreds of inputs and hundreds of outputs. As you might imagine sometimes developers get some escaping wrong and that is where things start going bad.

Let’s make an example, suppose you are using “text/template” to generate pages:

<div>
Hello I'm {{.Username}} and welcome to my shop!
</div>

If the username is Rob the output of the page will be

<div>
Hello I'm Rob and welcome to my shop!
</div>

This works as intended. But what happens if the username is <script>evilCode()</script>?

When the browser receives the code, it can’t figure out that the string above should be treated as text, so it executes it:

<div>
Hello I'm <script>evilCode()</script> and welcome to my shop!
</div>

In this example the evilCode() function is invoked.

An attack scenario would be that an evil user signs up in this application, enters the malicious code as their name, and sends a link to their profile to the victim. When the victim visits the profile the name of the attacker is improperly interpolated in the page and evil code is executed. This will cause the application to perform actions in the name of the victim without them knowing.

The example above is an instance of stored XSS as it persists in the database of the application. Some other server-side XSS might only trigger when interpolating data from the request into the response, and those are called reflected XSS. One example in Go would be:

func vulnerableHandler(w http.ResponseWriter, r *http.Request) {
  r.ParseForm()
  tok := r.FormValue("token")
  if !isValid(tok) {
    fmt.Fprintf(w, "Invalid token: %q", tok)
  }
  //...
}

This would reflect part of the input in the output without escaping, which causes XSS (you should never use fmt to write to a HTTP response).

In both reflected and stored XSS the culprit is a lack of proper escaping. The way you you can think about this is that you should only be able to write trusted content to the page. Everything else would need to be “promoted” to be trusted. You can imagine text and HTML to be two different types:

TrustedHtml + Trusted HTML → TrustedHtml
TrustedHtml + escapeHtml(text) → TrustedHtml
TrustedHtml + sanitizeHtml(untrustedHTML) → TrustedHtml
TrustedHtml + text → ERROR

If you are a developer like me you don’t want to care about all the possible escaping contexts and steps you have to go through, you just want things to work. Stuff should be secure by default and not require the developers to care about it. Libraries should be close to impossible to use wrong.

A solution

If you are working with Go you are lucky. The html/template package performs contextual auto-escaping. This means that when your templates are parsed the library detects in which context you are putting strings in and it will pick a chain of escaping functions to properly encode it.

If you are interpolating, for example, in this context:

<script>
  var a = "{{.Data}}";
</script>

This will automatically be encoded in a way that an attacker could not break. This means that 2 encoders will be executed:

JavaScript string: characters like \ and " will be escaped like \\ \" so that the attacker cannot “get out” of the JavaScript string context.
HTML: characters like < will be encoded to < so that if the attacker is called </script> it will not break the “script” context and won’t be able to inject HTML in the sources of the page.

If the first escaping wouldn’t have been performed a string like "; evilCode(); var b = " would have executed evilCode.

This might seem trivial but there are some complicated cases where escaping might not be intuitive, like inside style blocks or HTML attributes (where escaping depends on the attribute key). Currently the standard library supports 16 different escaping functions and it goes through 24 different possible valid states and contexts while parsing the templates. It is very well designed and has only very minor known issues. I advise against trying to do this by hand or re-implementing this logic yourself. For Go 1.13 as long as you don’t interpolate data in a html tag name (and why would you?) or a JavaScript template literal (you shouldn’t use two templates together anyways) your app should be in a pretty good shape.

Moreover all widespread open-source libraries that I found in the wild for Go and Rust do not perform contextual auto-escaping so keep in mind that if you want to use a template that is not html/template you are putting yourself at risk. On this topic I can only suggest to stick to the standard library to stay safe.

Note: in the near future Google is planning to open-source another Go library that grants an even better level of protection against XSS. Stay tuned to be updated when it is released!

If you want to know more about some bypasses and advanced details about contextual auto-escaping here is your poison.

A final but very important note: if your response is not HTML (e.g. JSON or plain text) it is of vital importance that you appropriately set the Content-Type header to the correct value.

For example, in Go:

func jsonHandler(w http.ResponseWriter, r *http.Request) {
  // This should be set on the entire service
  w.Header().Set("X-Content-Type-Options", "nosniff")
  // This must be set on every JSON response
  w.Header().Set("Content-Type","application/json; charset=utf-8")
  // Write your JSON response here.
}

If you don’t do this there are some attacks that can be performed to trick the browser into executing code from non-HTML responses.

Client-Side (DOM based) XSS

If you think using the proper template will save you from XSS, you’re out of luck. There is still a whole family of XSS that is waiting behind the corner to hit you and your users when you least expect it. The server might do all the escaping correctly depending on the context the strings are interpolated in but, then, the client-side code might just decide to execute arbitrary code.

The Document Object Model(DOM) is a way to programmatically access and modify a webpage. JavaScript can call special functions and methods that can, during or after page load, change something.

Let’s look at an example:

<script>
var mydiv = document.getElementById("myDiv");
mydiv.innerHTML = decodeURI(window.location.hash);
</script>

If the current page URI ends with #<img%20src=x%20onerror="alert(1)"/> this code will execute alert(1) in the context of the page. The attack scenario in this case is that the attacker sends to the victim a link like:

https://vulnerable.com/#<img%20src=x%20onerror="evilCode()"/>

and if the user clicks on it the attacker executes evilCode in their browser, in the context of the vulnerable application.

As you can see no server-side code was involved. This is true to the point where the “frame” part of a URI (the string after and including the first # character) is not even sent to the server.

As a developer you might want to protect against this kind of vulnerability and you might expect the web platform to provide standard APIs for escaping and some well-defined ways to disable all JavaScript APIs calls that might result in an XSS.

Well… It doesn’t ¯\_(ツ)_/¯.

That said something can still be done: proper escaping can be performed with libraries like DOMPurify and a way to enforce a secure access to the DOM APIs like innerHTML is being worked on with trusted types. If you want to experiment with something new there is a a polyfill library ready for you to use. Moreover, if you use a major framework like Angular you should probably read their considerations on XSS. Please note that React does not perform URI sanitization so you should be very careful when interpolating URIs (like removing anything that starts with case-invariant javascript: schemes).

Mitigations

There are some features in the web platform that were built to help out. In the sad case in which some attacker-controlled code might end up in your HTML page there are some countermeasures that might still render the exploitation extremely hard or, sometimes, impossible. Those countermeasures help in both server-side XSS and DOM XSS, but they might have limited benefits in the latter.

Mitigation: Content Security Policy

Content Security Policy (CSP) allows developers to declare which scripts they trust so that the browser will refuse to execute any other code. Let’s see an example of HTTP HTML response protected with strict CSP:

HTTP/1.1 200 OK
Content-Type: text/html;
Content-Security-Policy: script-src 'nonce-thisIsSecureRandom'
Content-Length: 156

<html>
<body>
<script nonce="thisIsSecureRandom"> 
// This gets executed.
</script>
<script>
// This does not get executed.
</script>
</body>
</html>

The idea is to use an HTTP header to communicate to the browser a secret that changes for every HTML response generated by the server. Only scripts that have the nonce attribute set to that secret will be executed. This means that the attacker will need to guess the random or somehow predict it in order to get code to execute. This makes exploitation significantly harder and sometimes impossible.

If you want to (and you should) adopt CSP on your application you can take a look at Google official guide for it which contains an actual policy Google uses in production (the one above is oversimplified and might cause problems with some browsers).

Note: There currently is no library in Go to automatically add nonces and CSP but I’m working on it and will update you when it is ready.

In case your application doesn’t use JavaScript and you want to make sure it never will, not even by accident, you can set content security policy to the one described in the guide above and replace the script-src directive to be 'none'. This renders XSS impossible to exploit.

Content-Security-Policy: base-uri 'none'; object-src 'none'; script-src 'none';

Note: want to know more about CSP, the successes, the difficulties and the way to deploy and harden it at scale? We got you covered: a couple of engineers in my team gave a talk at LocoMocoSec this year on how to do it at Google scale!

Mitigation: Cookies scope

Cookies should be scoped and restricted to only be sent when necessary. In addition to that cookies should not be directly accessible by JavaScript. This can be achieved with the Domain and Path and HttpOnly attributes.

This mitigation provides little protection against XSS but it is very simple to deploy so it is still worth the effort.

Recap

If you want to secure you application from XSS you should:

Use templating engines with contextual auto-escaping (Go standard library or Google closure templates do it);
Appropriately set Content-Type (both response type and charset) and make sure browsers don’t “sniff” the Content-Type;
Adopt strict CSP;
Make sure to carefully review all uses of dangerous DOM APIs, use your frameworks properly and, potentially, try out trusted types;
Set your cookies as HttpOnly and scope them so that only the endpoints that need them receive them.

Let’s apply this to Go!

Trusted Types and JavaScript frameworks are client side, so not much for Go to do, but we can take a look at the rest.
To generate responses make sure you always write to an http.ResponseWriter set to Content-Type: text/html with an HTML template and nothing else. It should be easy to catch this mistake in code review but it is also quite easy to write an analyzer for it. If you want to try and write such a tool here is the doc and a talk about it. If you end up writing it, please share it!
Strict CSP and contextual auto-escaping can be done with something like the following proof of concept. Make sure you import "html/template" and "crypto/rand" and not "text/template" and "math/rand".

The code:

package main

import (
	"context"
	"crypto/rand"
	"encoding/base64"
	"fmt"
	"html/template"
	"log"
	"net/http"
)

const (
	cspKey = "csp-nonce"
	cspTpl = "object-src 'none'; script-src 'nonce-%s'; base-uri 'self'"
)

func protect(h http.Handler) http.Handler {
	return http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
		w.Header().Set("X-Content-Type-Options", "nosniff")
		nonce := genNonce()
		w.Header().Set("Content-Security-Policy",
			fmt.Sprintf(cspTpl, nonce))
		ctx := context.WithValue(r.Context(), cspKey, nonce)
		h.ServeHTTP(w, r.WithContext(ctx))
	})
}

var tpl = template.Must(template.New("hello").Parse(`
<script nonce="{{.CSPNonce}}"> alert("This works!"); </script>
<script> alert("This doesnt!"); </script>`))

func main() {
	http.Handle("/", protect(
		http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
			w.Header().Set("Content-Type", "text/html; charset=utf-8")
			tpl.Execute(w, map[string]interface{}{
				"CSPNonce": r.Context().Value(cspKey),
			})
		})))
	log.Fatal(http.ListenAndServe(":8080", nil))
}

func genNonce() string {
	var b [20]byte
	if _, err := rand.Read(b[:]); err != nil {
		panic(err)
	}
	return base64.StdEncoding.EncodeToString(b[:])
}

Please note that here I’m decorating only a single handler but an entire http.ServeMux can be protected with CSP in the same way. Setting CSP on non-html responses will not affect your application.

Cookies attributes can be set together with cookies by using the standard http package.

This is all for today, stay tuned for the next chapter: Cross Site Request Forgery(CSRF).

Rob'n Go security pearls: Cross Site Scripting (XSS)