Vishnu Bharathi

Fix up

2024-09-13T20:58:08.000Z

I want you to read this post if you are at the verge of quitting your job.

You want to quit this job because you are unhappy.

Unhappy about what? recent changes in your team, a culture shift? or maybe the thought that work is not fun here anymore? or maybe you are worried that you are going to get laid off? or maybe you feel that you are being undervalued? or maybe you feel that you are being underpaid? or maybe some other thing(s).

That’s a lot of “maybe”s. It doesn’t matter what the reason is, because the thing is, “you are unhappy” and you must do something about it.

Ok, so what can you do about it?

I realized that we have two options here.

Option one: shut up.

Option two: fix up.

Shut up means you literally shut up, grind your nights and weekends, focus on finding a new job, and quit your current job.

Fix up is a messy route. I have seen the best engineers whom I have worked with to shy away from this path. It takes a certain level of clarity, courage, and bravery to pull it off. (especially if you are living paycheck to paycheck like me - lol)

Here is how to fix up. You write down all the things that you are unhappy about on a piece of paper. Promise me that you will be brutally honest with yourself while writing this list. And here is the difficult part: You are going to go to your leadership team and share that you are unhappy and read them the list that you prepared. That usually means scheduling a call with your manager, or your manager’s manager (in case of a larger org), or your CEO (in case of startups). Meet up with the person you are most comfortable with (and that ideally will be your manager and in some unfortunate circumstances not).

Fix up will not just try to fix the situation for you. It has the potential to fix the situation for your colleagues who are in the same boat as you. It is going to empower your leadership team to do the right things. It can even save the entire company.

Here is a side note for people who are in leadership roles who are reading this post: The best thing that you can do for your team is to create an environment where one feels to “fix up” without having to get an offer letter from another company. I don’t know how exactly you are going to do it but I will leave it as an activity for the readers.

Back to you now. I want you to seriously consider this path because it might be the shortest path to regain your happiness.

If you are in good hands, your leaders are going to empathize with you and act upon the feedback that you give them or find a way to help you. I will give you an example: I tried to “fix up” before in one of my previous jobs (but with a new job offer though - I was such a coward, lol). I spoke with the CEO and my reason was “work was not fun anymore”. The CEO acted immediately. He had a bunch of interesting work for which he was not finding time/people to do. He proposed I team up with him and try it out. So for the next few months, I reported to him directly and we tackled the project. That made me regain happiness and at the same time helped the company move on to the next phase.

I mentioned courage and bravery at the start because you have to try to “fix up” without getting a new job offer. In my above case, I did with an offer and that is a waste of time for you and the people who chose to interview you. Imagine all the time I spent preparing for interviews and all the time and energy of the company (that extended an offer) spent interviewing me. Imagine all the things that I could have done (like watching Cartoon Network or something) instead of preparing all those nights and it is such a waste to decline job offers.

So, go ahead, be brave, and fix up!

Best case, you are happy again.

Worst case, what is going to happen? Your leadership team doesn’t act up on it. I can hear the paranoid human inside you screaming, “What if they blackmark me for saying things aloud? Are they going to fire me because I am not a culture fit anymore?”. I will run by your script here. Let us hit your deepest fear, “They are going to fire me”. Well, what if they do fire you? What are you going to do?

You are going to prepare full-time for interviews sitting at your comfy little house and appear for interviews and get a new job. And you are happy again.

The reason why one might not choose to fix up is because they already gave up hope on their leaders or it could be that they don’t understand the cost of switching jobs. Switching jobs is a costly act, you know! Both for you and the company. Especially in startups where one person gets to work on a lot of diverse stuff. Think about the struggle that your colleagues and your company might go through when you choose to leave them. Kind of sad, right? I am not suggesting that you should not switch jobs at all (lol, look at my LinkedIn) - you should switch if that is what makes you happy again. Remember “your happiness” is your core metric. When you are happy, you can make wonders happen and do your best work. But I guess your company usually deserves a fix up if you think it is going to help the company (with or without you).

~ ~ ~ ~

I would like to thank svs for listening to me through a career chat and offering his suggestions. That conversation made me think about and write down this blog post. BTW, if you are looking for a new job, you have to talk to him.

And thanks to git rebase -i - that is where I copied the word “fix up” from.

Know when to break up with Go's http.DefaultClient

2024-07-07T15:32:27.000Z

These might be the first set of snippets that you see when trying to use Go’s HTTP client. (taken from the “overview” section of the standard library docs)

resp, err := http.Get("http://example.com/")
...
resp, err := http.Post("http://example.com/upload", "image/jpeg", &buf)
...

The same set of snippets has the potential to cause your first production outage. It is perfectly good code (up to a certain point). Things will start to get dirty when you introduce the following things into the mix:

when your program is starting to make a lot of HTTP calls.
when your program is making HTTP calls to more than one service (host names).

The reason behind it is this little variable declared in the net/http package.

1	var DefaultClient = &Client{}

Meet the DefaultClient

DefaultClient is of type *http.Client and http.Client is the struct that has all the code to perform HTTP calls. DefaultClient is a HTTP client with all the underlying settings pointing to the default values.

When you try calling those package-level HTTP methods like http.Get, http.Post, http.Do etc., the HTTP call is made using the DefaultClient variable. Two fields in the http.Client struct could translate the “default” and “shared” behavior of http.DefaultClient into potential problems:

type Client struct {
// Transport specifies the mechanism by which individual
// HTTP requests are made.
// If nil, DefaultTransport is used.
Transport RoundTripper

// .............

// Timeout specifies a time limit for requests made by this
// Client. The timeout includes connection time, any
// redirects, and reading the response body. The timer remains
// running after Get, Head, Post, or Do return and will
// interrupt reading of the Response.Body.
//
// A Timeout of zero means no timeout.
//
// The Client cancels requests to the underlying Transport
// as if the Request's Context ended.
//
// ....
Timeout time.Duration
}

The default value for Timeout is zero, so the http.DefaultClient does not timeout by default and will try to hold on to a local port/socket as long as the connection is live. What if there are too many requests? Combine this with an HTTP server which doesn’t timeout. Bingo! You got a production outage. You will run out of ports and there won’t be newer ports available for making further HTTP calls.

Next up is the Transport field in the http.Client. By default, the following DefaultTransport would be used in DefaultClient.

var DefaultTransport RoundTripper = &Transport{
Proxy: ProxyFromEnvironment,
DialContext: defaultTransportDialContext(&net.Dialer{
Timeout:   30 * time.Second,
KeepAlive: 30 * time.Second,
}),
ForceAttemptHTTP2:     true,
MaxIdleConns:          100,
IdleConnTimeout:       90 * time.Second,
TLSHandshakeTimeout:   10 * time.Second,
ExpectContinueTimeout: 1 * time.Second,
}

(a lot of things in there, but turn your attention to MaxIdleConns)

Here is the doc on what it does:

1
2
3

// MaxIdleConns controls the maximum number of idle (keep-alive)
// connections across all hosts. Zero means no limit.
MaxIdleConns int

Since the DefaultClient is shared, you might end up making calls to more than one service (host names) from it. In that case, there might be an unfair distribution of the MaxIdleConns maintained by the default client for the given set of hosts.

A small example

Let us take an example here:

type LoanAPIClient struct {}

func (l *LoanAPIClient) List() ([]Loan, error) {
// ....
err := http.Get("https://loan.api.example.com/v1/loans")
// ....
}

type PaymentAPIClient struct {}

func (p *PaymentAPIClient) Pay(amount int) (error) {
// ....
err := http.Post("https://payment.api.example.com/v1/card", "application/json", &req)
// ....
}

Both LoanAPIClient and PaymentAPIClient use the http.DefaultClient by calling into http.Get and http.Post. Let us say our program makes 80 calls from LoanAPIClient initially and then makes 200 calls from PaymentAPIClient. By default DefaultClient only maintains 100 maximum idle connections. So, LoadAPIClient will capture 80 spots in those 100 spots, and PaymentAPIClient will only get 20 remanining spots. This means that for the rest of 60 calls from PaymentAPIClient, a new connection needs to be opened and closed. This will cause unnecessary pressure on the payments API server. The allocation of these MaxIdleConns will soon get out of your hands! (trust me 😅)

How do we fix this?

Increase the MaxIdleConns? Yes, you can but if the client is still shared between LoanAPIClient and PaymentAPIClient then that too shall get out of hand at some scale.

I discovered the sibling of MaxIdleConns and that is MaxIdleConnsPerHost.

// MaxIdleConnsPerHost, if non-zero, controls the maximum idle
// (keep-alive) connections to keep per-host. If zero,
// DefaultMaxIdleConnsPerHost is used.
MaxIdleConnsPerHost int

This could help in maintaining a predictable number of idle connections for each endpoint (host name).

OK, how do I really fix this?

If your program is calling into more than one HTTP service, then you might most probably want to tweak other settings of the Client too. So, it might be beneficial to have a separate http.Client for these services. That way we can fine-tune them if needed in the future.

type LoanAPIClient struct {
client *http.Client
}

type PaymentAPIClient struct {
client *http.Client
}

It is fine

The conclusion would be this: It is okay to use http.DefaultClient to start with. But if you think you will have more clients and will make more API calls, avoid it.

Bonus: If you are authoring a library that has an API client, do a favor for your users: provide a way to customize the http.Client that you are using to make API calls. That way, your users have full control of what they would like to achieve while using your client.

~ ~ ~ ~

HTTP Clients inside an HTTP Server talking to another HTTP Server that has HTTP Clients, all authored by you. That will be your cue.

Against the io.TeeReader

2024-06-28T18:37:19.000Z

This is a follow-up blog post to my previous blog post about the io.TeeReader in Go. Here is the link for it if you haven’t read it yet: https://vishnubharathi.codes/blog/a-silly-mistake-that-i-made-with-io.teereader/

Motivation

The motivation for this blog post is this Reddit comment. One of the reasons why I write blog posts and share them on the internet is because I almost always learn more from the comments. That comment made me think more about the code I wrote in the previous blog post and realize some things I want to write up here. (A big thanks to the people writing insightful comments on the internet)

That weird `new(bytes.Buffer)`

To recap, I had a io.Reader as input and I was trying to read it twice so that I could upload the same data two times. Here is how the final solution looked like when using an io.TeeReader:

func Upload(r io.Reader) error {
contentForSecondUpload := new(bytes.Buffer)
contentForFirstUpload := io.TeeReader(r, contentForSecondUpload)

if err := firstUpload(contentForFirstUpload); err != nil {
return err
}

if err := secondUpload(contentForSecondUpload); err != nil {
return err
}

return nil
}

I always felt weird about the new(bytes.Buffer) that I have allocated in the code.

The whole point

The whole point of io.TeeReader is to take in one source reader and perform reads on it efficiently and make the data available at the other two ends of the “T”.

One of the highlights of that Reddit comment is, if I am allocating a buffer to store the contents of the source reader, why use a TeeReader at all?

If you’re going to allocate a buffer, then you might read the entire thing into memory first and read it twice.

That would look like

func Upload(r io.Reader) error {
contentForUpload, err := io.ReadAll(r)
if err != nil {
return err
}

if err := firstUpload(bytes.NewReader(contentForUpload)); err != nil {
return err
}

if err := secondUpload(bytes.NewReader(contentForUpload)); err != nil {
return err
}

return nil
}

This is a valid solution if I am okay with reading the entire input in memory and want my uploads to happen synchronously one after another.

io.TeeReader + io.Pipe

The comment made me realize that we could use io.TeeReader and io.Pipe together to achieve concurrent uploads like my final solution in the previous blog post did.

func Upload(r io.Reader) error {
contentForSecondUpload, contentWriter := io.Pipe()
contentForFirstUpload := io.TeeReader(r, contentWriter)

var upload errgroup.Group
upload.Go(func() error {
return firstUpload(contentForFirstUpload)
})
upload.Go(func() error {
return secondUpload(contentForSecondUpload)
})
return upload.Wait()
}

I am going to take this step by step. The above program would cause a deadlock. The reason: contentWriter is not closed and the secondUpload will always be waiting for more content to be available which it will never receive. To fix it, we must close the contentWriter somewhere, but where?

In the case of the pure io.Pipe implementation in the previous blog post, it was clear: We close the writers in the go routine where we finish the writing.

In the case of a TeeReader, the writes for contentForSecondUpload is complete when the read of contentForFirstUpload is finished. That looks like:

func Upload(r io.Reader) error {
contentForSecondUpload, contentWriter := io.Pipe()
contentForFirstUpload := io.TeeReader(r, contentWriter)

var upload errgroup.Group
upload.Go(func() error {
var err error
defer func() {
contentWriter.CloseWithError(err)
}()
err = firstUpload(contentForFirstUpload)
return err
})
upload.Go(func() error {
return secondUpload(contentForSecondUpload)
})
return upload.Wait()
}

I feel that the above code would be hard to follow. It can easily make one spend time thinking about “why would they close the writer of the second reader after reading the first reader?”.

The pure io.Pipe implementation feels more natural and human-friendly: we close the writer in the go routine where we are done writing to all the writers. At the same time, it gets the job done.

Verdict

I will avoid using io.TeeReader at all places and prefer using io.Pipe + io.MultiWriter instead. (the code from the previous blog post)

That makes the code efficient, concurrent, and easy to read/write/extend.

~ ~ ~ ~

Always use the pipe and close it.

A silly mistake that I made with io.TeeReader

2024-06-27T00:39:47.000Z

I recently made a silly mistake while using io.TeeReader in Go and I am writing this blog post to sum up my learnings from this experience.

Why I used it in the first place

Ok, here is why I chose to use it in the first place: I had some content and two functions that needed that content and perform uploads to two different HTTP endpoints. Something like

func main() {
Upload(strings.NewReader("hello world"))
}

func Upload(r io.Reader) error {
if err := firstUpload(r); err != nil {
return err
}

if err := secondUpload(r); err != nil {
return err
}

return nil
}

func firstUpload(r io.Reader) error {
content := io.MultiReader(strings.NewReader("first upload:"), r, strings.NewReader("\n"))
if _, err := io.Copy(os.Stdout, content); err != nil {
return err
}

return nil
}

func secondUpload(r io.Reader) error {
content := io.MultiReader(strings.NewReader("second upload:"), r, strings.NewReader("\n"))
if _, err := io.Copy(os.Stdout, content); err != nil {
return err
}

return nil
}

The output of the above program would be

1 2	first upload:hello world second upload:

The first upload consumes all the data from the reader and by the time the reader reaches the second upload, there isn’t anything to be read. If this is new to you, I encourage you to take a look at the standard lib docs for io.Reader to better understand the situation: https://pkg.go.dev/io#Reader

Using TeeReader (but with my mistake)

OK, so what do I do now? I google search the problem and discover about Go’s io.TeeReader. Let us see what the program would look like after I tried to use the TeeReader.

func Upload(r io.Reader) error {
contentForFirstUpload := new(bytes.Buffer)
contentForSecondUpload := io.TeeReader(r, contentForFirstUpload)

if err := firstUpload(contentForFirstUpload); err != nil {
return err
}

if err := secondUpload(contentForSecondUpload); err != nil {
return err
}

return nil
}

And the output for this is

1 2	first upload: second upload:hello world

That is weird. The second upload is succeeding but not the first one?

Fixing the mistake

This probably is the best place to quote the docs of io.TeeReader:

1	func TeeReader(r Reader, w Writer) Reader

TeeReader returns a Reader that writes to w what it reads from r. All reads from r performed through it are matched with corresponding writes to w. There is no internal buffering - the write must complete before the read completes. Any error encountered while writing is reported as a read error.

So we get back a Reader (contentForSecondUpload in our case) and when that is read, a simultaneous write is happening to the writer (contentForFirstUpload in our case) that we pass. But what happens in the code is, we try to read from the writer before writes are happening to it.

I am not sure if I did a good job of explaining the fix in plain words above, but here is the code that fixes the problem:

func Upload(r io.Reader) error {
contentForSecondUpload := new(bytes.Buffer)
contentForFirstUpload := io.TeeReader(r, contentForSecondUpload)

if err := firstUpload(contentForFirstUpload); err != nil {
return err
}

if err := secondUpload(contentForSecondUpload); err != nil {
return err
}

return nil
}

So rule no.1 here is: always read the reader returned back from io.TeeReader first. That is the thing that is copying the data and making it available for the other buffer (writer).

That’s it, that is the only rule.

io.Pipe

Now that we have fixed the problem, I think we can take a short detour to visit one of my favorite Go constructs: io.Pipe which could also help solve these kinds of problems.

Here is a quick refactor of our code using io.Pipe.

func Upload(r io.Reader) error {
var upload errgroup.Group

fr, fw := io.Pipe()
upload.Go(func() error {
return firstUpload(fr)
})

sr, sw := io.Pipe()
upload.Go(func() error {
return secondUpload(sr)
})

upload.Go(func() error {
var err error
defer func() {
fw.CloseWithError(err)
sw.CloseWithError(err)
}()

_, err = io.Copy(io.MultiWriter(fw, sw), r)
return err
})

return upload.Wait()
}

This has some advantages and one of them would have helped me in avoiding my mistake with io.TeeReader.

uploads become concurrent naturally unlike TeeReader where it is sequential.
the order in which we read the readers for the first upload and second upload does not matter anymore.

With that said, I would still be mindful about introducing io.Pipe. Here is what I have decided.

If I need to write to one or two writers and do not need concurrency, I would stick with io.TeeReader. I will stick to io.Pipe for every other case.

I have changed my mind a bit, please turn to the next page: https://vishnubharathi.codes/blog/against-the-io.teereader/

Exploring Middlewares in Go

2024-05-20T18:40:08.000Z

I came across “Middlewares” for writing HTTP servers originally in the Node.js ecosystem. There is this beautiful library called express which sparked the joy of middleware in me. In case you haven’t heard of middleware before, I think you should read this beautiful page from expressjs documentation to get a taste of them. (I genuinely feel that it is the best possible introduction for middleware, hence opening up the post with it)

With enough JavaScript for the day, we will jump into Go now. 😅

My goal for this post is to understand how to {use, write} middlewares in Go HTTP servers. We will also try to search the internet and surface some Go middlewares that we can add to our day-to-day toolkit.

Problem

Let us take a simple problem and work our way upwards. Here is the problem statement:

Write an HTTP server that contains multiple routes. When a request is made to a route, print a log line at the start and the end of the request. Something like

2024/05/21 00:49:32 INFO start method=GET path=/one
2024/05/21 00:49:32 INFO end method=GET path=/one
2024/05/21 00:49:34 INFO start method=GET path=/two
2024/05/21 00:49:34 INFO end method=GET path=/two

Solution

Without Middleware

A solution without using middleware would look like

package main

import (
"fmt"
"log/slog"
"net/http"
)

func main() {
http.HandleFunc("/one", func(w http.ResponseWriter, r *http.Request) {
slog.Info("start", "method", r.Method, "path", r.URL.Path)
defer slog.Info("end", "method", r.Method, "path", r.URL.Path)

fmt.Fprintln(w, "this is one")
})

http.HandleFunc("/two", func(w http.ResponseWriter, r *http.Request) {
slog.Info("start", "method", r.Method, "path", r.URL.Path)
defer slog.Info("end", "method", r.Method, "path", r.URL.Path)

fmt.Fprintln(w, "this is two")
})

http.ListenAndServe(":3000", nil)
}

How do we avoid copy-pasting those two lines to every HTTP handler function? Middlewares for the win!

Basic Middleware

package main

import (
"fmt"
"log/slog"
"net/http"
)

func logRequest(next func(http.ResponseWriter, *http.Request)) func(http.ResponseWriter, *http.Request) {
return func(w http.ResponseWriter, r *http.Request) {
slog.Info("start", "method", r.Method, "path", r.URL.Path)
defer slog.Info("end", "method", r.Method, "path", r.URL.Path)

next(w, r)
}
}

func oneHandler(w http.ResponseWriter, r *http.Request) {
fmt.Fprintln(w, "this is one")
}

func twoHander(w http.ResponseWriter, r *http.Request) {
fmt.Fprintln(w, "this is two")
}

func main() {
http.HandleFunc("/one", logRequest(oneHandler))
http.HandleFunc("/two", logRequest(twoHander))

http.ListenAndServe(":3000", nil)
}

Using http.HandleFunc

We are not done yet! There is still room for improvement. Notice how big the method signature for logRequest is! we can start from there. I remember a standard library type called http.HandlerFunc which could be used in the place of func(ResponseWriter, *Request). If we start using it, our middleware looks like this.

func logRequest(next http.HandlerFunc) http.HandlerFunc {
return func(w http.ResponseWriter, r *http.Request) {
slog.Info("start", "method", r.Method, "path", r.URL.Path)
defer slog.Info("end", "method", r.Method, "path", r.URL.Path)

next(w, r)
}
}

While browsing through the Go docs, I noticed that http.HandleFunc has the below method signature.

1	func HandleFunc(pattern string, handler func(ResponseWriter, *Request))

That raised a question in me. Why don’t they use func HandleFunc(pattern string, handler http.HandlerFunc) instead? I thought http.HandlerFunc is an alias type for func(ResponseWriter, *Request). Digging through the standard library source code had the answer. It seems like it is just not a simple alias, but more than that. Copy pasting the implementation of http.HanderFunc for you straight out of Go source :D

// The HandlerFunc type is an adapter to allow the use of
// ordinary functions as HTTP handlers. If f is a function
// with the appropriate signature, HandlerFunc(f) is a
// [Handler] that calls f.
type HandlerFunc func(ResponseWriter, *Request)

// ServeHTTP calls f(w, r).
func (f HandlerFunc) ServeHTTP(w ResponseWriter, r *Request) {
f(w, r)
}

oh wow, so http.HandlerFunc is a func(ResponseWriter, *Request) which implements the http.Handler interface.

Enter http.Handler

Why would we need an adapter like http.HandlerFunc that implements the http.Handler interface. To understand, let us take a look at the interface definition.

1
2
3

type Handler interface {
ServeHTTP(ResponseWriter, *Request)
}

and also read through the http.Handler documentation. At first, it didn’t solve my doubt, but then I discovered this beautiful example in the docs. Copy pasting the example from the docs here for you to have a quick look.

package main

import (
"fmt"
"log"
"net/http"
"sync"
)

type countHandler struct {
mu sync.Mutex // guards n
n  int
}

func (h *countHandler) ServeHTTP(w http.ResponseWriter, r *http.Request) {
h.mu.Lock()
defer h.mu.Unlock()
h.n++
fmt.Fprintf(w, "count is %d\n", h.n)
}

func main() {
http.Handle("/count", new(countHandler))
log.Fatal(http.ListenAndServe(":8080", nil))
}

wow, did you get it? Sometimes your handler is more than just a func(http.ResponseWriter, *http.Request). It could be a struct that contains data that could be used in your request logic. Like in the above case, countHandler maintains a counter protected by a mutex. Each and every request to /count would increment the counter atomically.

For simple routes, which are just a bunch of instructions we could use http.HandleFunc. But once your handler gets complex, like having to maintain data that is common to all requests of the handler, then move upward and go for http.Handle.

woah, this just cleared my long-standing doubt about “when to use http.Handle and http.HandleFunc?”

It is getting a bit clearer now on why the http.Handler interface is needed. With two ways of defining a HTTP handler: one being to write a func(http.ResponseWriter, *http.Request) and pass it to http.HandleFunc and another being to write a struct with the necessary logic and pass it down to http.Handle function, the standard libary needs a common ground in which all its methods can operate on both the types of handlers. Hence an interface.

http.HandlerFunc to http.Handler

Now that it is evident that a Go programmer could choose between using http.Handle or http.HandleFunc to serve their handlers, it is necessary that any HTTP middleware should work for both of those use cases. With the current approach to our solution, we will only support middlewares that are input to http.HandleFunc. Hence moving our middleware to use http.Handler interface, that way we could accommodate both types of handlers.

package main

import (
"fmt"
"log/slog"
"net/http"
)

func logRequest(next http.Handler) http.Handler {
return http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
slog.Info("start", "method", r.Method, "path", r.URL.Path)
defer slog.Info("end", "method", r.Method, "path", r.URL.Path)

next.ServeHTTP(w, r)
})
}

func oneHandler(w http.ResponseWriter, r *http.Request) {
fmt.Fprintln(w, "this is one")
}

func twoHander(w http.ResponseWriter, r *http.Request) {
fmt.Fprintln(w, "this is two")
}

func main() {
http.Handle("/one", logRequest(http.HandlerFunc(oneHandler)))
http.Handle("/two", logRequest(http.HandlerFunc(twoHander)))

http.ListenAndServe(":3000", nil)
}

Standard library Middlewares

The net/http package in the standard library of Go contains middlewares. If you haven’t realized it yet, don’t worry. That is because they don’t advertise those functions as “middleware” (ctrl+f on docs for middleware leaves you with 0 matches :D)

AllowQuerySemicolons

1	func AllowQuerySemicolons(h Handler) Handler

TIL that we could use semicolons instead of ampersands in query strings (though this style is deprecated by W3C). Read more about it here: https://github.com/golang/go/issues/25192. This middleware is present in the stdlib for solving that problem by replacing the ; with & under the hood.

MaxBytesHandler

1	func MaxBytesHandler(h Handler, n int64) Handler

This could be used to limit the acceptable request body size. Under the hood, it uses MaxBytesReader:

MaxBytesReader prevents clients from accidentally or maliciously sending a large request and wasting server resources. If possible, it tells the ResponseWriter to close the connection after the limit has been reached.

StripPrefix

1	func StripPrefix(prefix string, h Handler) Handler

The docs says

StripPrefix returns a handler that serves HTTP requests by removing the given prefix from the request URL’s Path (and RawPath if set) and invoking the handler h.

My first impression is how could this be useful. Oh, wait for the blast! Here we go once again with a beautiful copy-paste of an stdlib example.

package main

import (
"net/http"
)

func main() {
// To serve a directory on disk (/tmp) under an alternate URL
// path (/tmpfiles/), use StripPrefix to modify the request
// URL's path before the FileServer sees it:
http.Handle("/tmpfiles/", http.StripPrefix("/tmpfiles/", http.FileServer(http.Dir("/tmp"))))
}

TimeoutHandler

1	func TimeoutHandler(h Handler, dt time.Duration, msg string) Handler

As the name says, it times out the handler if the request is taking more than the given duration.

Third-party Middlewares

I came across this beautiful library called chi which comes loaded up with a bunch of middlewares out of the box: https://github.com/go-chi/chi?tab=readme-ov-file#middlewares

I would suggest starting with the default chi recommendation:

// A good base middleware stack
r.Use(middleware.RequestID)
r.Use(middleware.RealIP)
r.Use(middleware.Logger)
r.Use(middleware.Recoverer)

and then build up the chain. Go explore and catch ‘em all!

(also let me know your favorite middleware if you have one - because I am trying to discover more third-party middlewares in Go)

Communicate

When writing or using middleware, you may need to pass down a variable that was created by one middleware into another middleware or in the request handler. In the case of JS, we would just mutate the request object directly since it is dynamically typed :D (lol, good old days). In the case of Go, we can’t do that and we will need a way of passing through variables of any type via the available ResponseWriter or Request objects.

I have previously written a whole blog post on the pitfalls of context.WithValue and when not to use them. And well, this is actually the use-case where you can use them!

A context variable is available to you in all the middlewares and the handlers via the http.Request object. We could use that to store and pass down information.


type RequestKey string

var (
RequestIDKey RequestKey = "request-id"
)

func RequestID(next http.Handler) http.Handler {
return http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
ctx := context.WithValue(r.Context(), RequestIDKey, uuid.New())
next.ServeHTTP(w, r.WithContext(ctx))
})
}

func handlerThatUsesRequestID(w http.ResponseWriter, r *http.Request) {
reqID, ok := r.Context().Value(RequestIDKey).(uuid.UUID)
if !ok || reqID == uuid.Nil {
fmt.Fprintln(w, "this a request without requestID")
return
}
fmt.Fprintf(w, "request id is %s\n", reqID)
}

You still need to be careful while using context.WithValue. What if you miss calling a middleware, but try to look up the value that it is supposed to set in r.Context? It changes the trajectory of your request during runtime and in the worst case it will lead to runtime panics in your handler. I am wondering if we could somehow catch this kind of stuff during compile time (like maybe by writing a library or perhaps someone already thought about this before - if so, let me know!)

Chain

You might soon end up having to call multiple middleware for your handlers. In that case, your code would look like:

// middlewares for unauthenticated routes
http.Handle("/", Logger(RequestID(homeHandler))))

// middlewares for authenticated routes
http.Handle("/profile", Logger(RequestID(BasicAuth(userProfileHandler)))))

We need a way to chain the middleware and store the chain so that we can reuse it between handlers. I recently discovered a library for this, which might help here: https://github.com/justinas/alice

unAuth := alice.New(Logger, RequestID)
auth := alice.New(Logger, RequestID, BasicAuth)

http.Handle("/", unAuth.Then(homeHandler))
http.Handle("/profile", auth.Then(userProfileHandler))

You can also use a routing library like chi where the request middlewares are defined at the router level.

Closing Thoughts

I hope this exploration was useful to you! It definitely made me learn some unexpected things like “when to use http.Handle? when to use http.HandleFunc? ….”. This is also inspiring me to write a small middleware library that I have been thinking about.

~ ~ ~ ~

In an alternate universe, someone declared type Middleware func(Handler) Handler in net/http and (use your imagination).

Vishnu Bharathi

Fix up

Know when to break up with Go's http.DefaultClient

Meet the DefaultClient

A small example

How do we fix this?

OK, how do I really fix this?

It is fine

Against the io.TeeReader

Motivation

That weird new(bytes.Buffer)

The whole point

io.TeeReader + io.Pipe

Verdict

A silly mistake that I made with io.TeeReader

Why I used it in the first place

Using TeeReader (but with my mistake)

Fixing the mistake

io.Pipe

Exploring Middlewares in Go

Problem

Solution

Without Middleware

Basic Middleware

Using http.HandleFunc

Enter http.Handler

http.HandlerFunc to http.Handler

Standard library Middlewares

AllowQuerySemicolons

MaxBytesHandler

StripPrefix

TimeoutHandler

Third-party Middlewares

Communicate

Chain

Closing Thoughts

That weird `new(bytes.Buffer)`