Know how the top DeFi protocol works under the hood
Welcome to the in-depth DeFi series! This series will be composed of several posts; each one will consist of an explanation of an individual protocol. The main purpose of this series is to understand from a first-principles approach how the top DeFi protocols work under the hood.
We will review them from a theoretical and practical level. This means that we will understand the theory behind the protocol, but we are also going to dive deep into the actual implementation (code).
Prerequisites
In order to follow along, it is useful to have the following knowledge:
- General understanding of the EVM
- Comfortable with Solidity
- Basic math (Algebra)
- General understanding of the fundamentals of blockchains and smart contract protocols built on top
The post is divided into two main parts: The first one, “Systems Components Overview,” explains the theory and practice behind the most important components that make the system work.
The second part, “System’s CORE Implementation,” goes through the actual architectural implementation.
I believe that to truly understand a system in depth, you should be able to rewrite it from scratch. Hopefully, by the end of this post, you will be able to do that.
In order to start learning the system in a structured manner, it is important to ask ourselves some questions that are fundamental to the system:
- How does Uniswap’s pricing mechanism work?
- Who provides liquidity to the system?
- Why would someone provide liquidity to the system?
- How does the system represent liquidity?
Along the way, all the fundamental questions should be answered by understanding the protocol deeply.
Before starting, let’s quickly talk about what Uniswap is.
Uniswap is a decentralized automated market maker protocol that allows anyone to swap token A for token B. Automated market makers work differently from a traditional order book model. The core principle that differentiates an automated market maker (Uniswap) from a traditional centralized order book is that the former runs permissionless code on the blockchain allowing anyone to participate.
Before moving on, I want to say a couple of things.
- Uniswap V2 is decentralized: Decentralization is not binary, but from my point of view (and many others), a decentralized smart contract protocol is one in which no central party can access the user’s funds. In other words, no one has a special key that can modify the contracts.
- There are three versions of Uniswap: For this post, we will focus on V2 (we will do a V3 post in the future). But if you want to learn how Uniswap was created, I suggest watching this video.
- A lot of AMMs are a fork of Uniswap V2: Uniswap V2 is much simpler than V3, therefore, easier to maintain. So once you know how Uniswap V2 works, you will know most of the AMMs as well!
Let’s start!
To start getting a feeling of how the protocol works, we need to ask ourselves what is really happening when we are doing a simple swap. Of course, the purpose of the entire post is to explain this.
But at a basic level, you are interacting with a smart contract (pool) that “holds” reserves of two different tokens. For example, if you are swapping some WETH for USDC, you are interacting with the WETH/USDC smart contract pair (we will detail how this works later on). If you are interested, here is the WETH/USDC smart contract pair on Etherscan.
So, if you are buying USDC with WETH, you are increasing the supply of WETH in the pool and reducing the supply of USDC, therefore, increasing the relative price of USDC vs WETH.
In the following sections, we will discuss how the pool works (the math behind it).
One important thing to keep in mind is that UniswapV1’s pools were always traded against Eth. In other words, every pool was ETH/xToken.
In Uniswap V2, every pool is ERC-20/ERC-20. This approach provides more flexibility to liquidity providers as they don’t need to rely 100% on ETH. So, every time you swap ETH for another token, your ETH gets converted to WETH first.
Uniswap works by providing incentives to different participants to work with the system in order for them to profit. The main participants of the system are:
- Traders: Traders can perform several actions in the system; some of them are:
- Speculating on the price of an asset.
- Arbitrage software. For example, if the price of ETH in Uniswap is trading at $3,000 and in Coinbase is trading at $3,800, the arbitrageur would immediately buy ETH in Uniswap and resell it in Coinbase for a profit.
- Basic swap functionality. You want to change token
x
to tokeny
to use them in a Dapp.
2. Liquidity Providers (LPs): LPs provide liquidity to the token pools. As rewards, they get the transaction fees.
3. Developers: Developers create the systems and applications that exist in the ecosystem. In Uniswap terms, they can be core protocol developers, third-party Dapp developers that integrate with Uniswap, wallet developers, etc.
Before diving into smart contracts, let’s understand the core concepts.
There are three basic components that we need to understand in order to have a deep knowledge of Uniswap (and most of the DeFi protocols): They are constant product formula, arbitrage, and impermanent loss.
Fees
UniswapV2 charges a flat 0.3% fee per trade. We will see how this is calculated in future sections. The fee goes to the liquidity providers, and this is to reward people for their liquidity. The protocol can also trigger a change that would give 0.05% to the Uniswap team, and that part of the fee would be discounted from the LPs instead of the traders.
A lot of the automated market makers work thanks to the constant product formula. It is a very simple yet powerful algorithm.
The constant product formula is the automated market algorithm that powers the Uniswap protocol (and a lot more AMMs).
This formula simply states that the invariant k
must remain unchanged regarding the outflow/inflow of x
and y
.
In other words, you can change the value of x
and y
to whatever you want, as long as k
remains the same.
For example:
What are x
and y
?
x
and y
are the reserves of the tokens in the pool. For example, if you are swapping DAI for WETH, you are interacting with the DAI/WETH smart contract pool. The total amount of DAI that the contract holds would be x
, and the total amount of WETH would be y
.
Simple enough. This is the algorithm that powers a lot of AMMs.
In the actual implementation, the formula is a bit different.
Let’s see the real formula that Uniswap V2 uses.
Here, you can find the full “swap” function implementation. But we are particularly interested in the following lines of code:
This piece of code is the real implementation of the constant product formula. Everything else you have read or seen is just the theory. These lines of code are the bare metal implementation of the constant product formula.
There are four variables inside the require
statement:
balance0Adjusted
: Reserves of x after the trader sends tokensX to the pool minus 0.3% of the amount sent.balance1Adjusted
: Reserves of y after the tokensY are sent to the trader from the pool._reserve0
: Reserves of tokenx
prior to the swap._reserve1
: Reserves of tokeny
prior to the swap.
Of course, balance0Adjusted
and balance1Adjusted
can be treated inversely.
Let’s do an example to understand this better:
- There is a DAI/WETH pool in Uniswap.
- Current reserves are 20000DAI / 10 WETH, and a trader wants to exchange 1 WETH for 1500 DAI:
As you can see, the trader sent 1 WETH to the pool in exchange for 1500 DAI. They could have gotten more DAI for their WETH, but the point here is to illustrate two things:
- The
k
never remains constant. It always increases due to the transaction fees, but it can also increase if people use the system inefficiently (like in our example, the trader could have gotten more DAI for their WETH). - The only check that the swap function really enforces is that the new reserves (minus fees) are greater than or equal to previous reserves:
newK ≥ oldK.
Here is actual code implementation of the previous example:
As you can see, things are multiplied by 1000. This is because there is no floating point numbers in Solidity, so this is necessary to represent the 0.3% fee. That is why the amount0In or amount1in are multiplied by 3 (to represent the 0.3%).
Getting the maximum amount out
A very important thing to understand is the maximum amount of tokens that you can get for a given input.
NOTE: In Uniswap V2 Periphery, you have all the helper functions to implement this math.
Example:
We have a DAI /WETH pool with the following reserves:
– DAI: 100,000
– WETH: 20
We want to swap 1 WETH for the maximum amount of DAI.
The maximum amount of DAI that we would get for 1 WETH (for this pool) is 4748.29.
The math we just did can also be represented in code (this function is from V2 Periphery):
It is very important to always get this math right in order to avoid getting fewer output tokens.
As a final note on this topic, you may be wondering what determines the variable k
?
K
is determined when the pool is first created, and then it grows every time there is a new trade. If we go to our past example, if a trader executes a swap just after the one we did, the k
for their trade is going to be the newK
of the prior trade.
Arbitrage is one of the most important concepts to understand how Uniswap works. Although this concept is not unique to Uniswap, it applies to almost all of the DeFi projects. In order to understand this concept better, the first thing we need to ask ourselves is, how does Uniswap know the price of a given token?
For example, when you are swapping WETH for DAI in Uniswap, how does the protocol know that the price of WETH relative to DAI is x
?
The short answer is that Uniswap has no clue what the real price is. In reality, the price matches the outside world due to incentives.
The only thing that the protocol really enforces in order to make a successful swap is the “constant product formula.”
Let’s go through an example.
Suppose there is a DAI/WETH pool in Uniswap. The price of WETH in Uniswap is trading at around 3000 DAI. We can get the price by dividing the number of reserves of DAI by the number of reserves of ETH (this is not the precise amount that you would get, we calculated that in the previous section).
Now, here comes the opportunity!
The price of WETH is trading at $3,200 on Coinbase. There is a difference of $200 USD between the WETH price of Uniswap and Coinbase. This will create an arbitrage opportunity, and immediately an arbitrageur (bot software) will buy WETH in Uniswap and sell it in Coinbase until the price matches.
This is how the prices in Uniswap are correlated with the outside world by arbitrageurs that are constantly searching for an arbitrage opportunity. There are different types of arbitrage (we will not go into detail). But for the example shown, it is not an atomic arbitrage.
In other words, the risks are higher because there is a probability that when the arbitrageur tries to earn a profit in Coinbase, the prices are already at par (because another arbitrageur did it first). That is why the safest arbitrage is done on-chain because transactions are atomic, meaning, if something fails, the whole transaction reverts (minus gas fees).
So, in conclusion, Uniswap does not know about the outside world prices. Thanks to the arbitrageurs, the prices are almost identical to the outside world.
Impermanent loss for liquidity providers is the change in dollar terms of their total stake in a given pool versus just holding the assets.
Let’s create an example to understand this better.
Imagine that Alice has some tokens in her wallet, so she wants to become a liquidity provider to earn some yield. For our example, she is going to create a new DAI/WETH pool in Uniswap. The current market price of WETH is $3,000, so to avoid arbitrage, she will need to initiate the pool by putting 3000 DAI for every WETH.
She will initiate the pool by putting 10 WETH and 30,000 DAI in, with a total USD value of $60,000 (we need to keep this number in mind).
*To make it simple, we will not take the transaction fees into consideration.
Great, so now Alice has become an LP of this DAI/WETH Uniswap pool. Now, let’s imagine that the price of WETH goes up to $4,687. As you should know by now, this will create a huge arbitrage opportunity, so immediately, an arbitrageur will buy ETH in Alice’s pool until the price is at par with the outside market.
So, after some arbitrage, Alice’s pool will look something like this (again, we are not considering the transaction fees):
As we can see, after some arbitrage, now the pool has 8 WETH instead of 10 and 37,500 DAI instead of 30,000 (what Alice initially supplied). The total amount of the pool is now $75,000 ((8ETH * $4,687.5) + 37,500)).
Here comes the problem: If Alice had just held the assets, she would have had more money. Remember that initially, she supplied the pool with 10 WETH and 30,000 DAI. So, in the end, she had an impermanent loss of $1,875:
For this example, we did not contemplate the profit from transaction fees (0.3%), but the concept remains unchanged.
It is called “impermanent” because the loss only applies if the LPs sell at the present time. If the assets fluctuate, then that loss can be mitigated.
Hopefully, the concept is clear by now! If still in doubt, I highly recommend watching this video.
Big amounts of liquidity are what make Uniswap an attractive system to use. Without enough liquidity, the system becomes inefficient, particularly for big trades relative to the total liquidity of the pool.
Note: For big trades (relative to the pool), it is better to use an aggregator to decrease slippage as much as possible.
Ok! So the first thing we need to understand is, how are LP tokens minted and burned?
When a liquidity provider provides liquidity to a pool, it receives its LP tokens proportionally to its amount of liquidity.
Let’s unpack this.
A Uniswap pool is just a smart contract that “holds” a certain amount of reserves (token x and y). The liquidity providers provide these reserves. But, this pool also has an in-house token called “LP token.” This token is unique to each pool, and the main purpose is to keep track of the liquidity each liquidity provider has injected. You can think of this token as a certificate of your liquidity. This token also accrues the 0.3% fee that Uniswap charges.
The smart contract pool imports “UniswapV2ERC20.sol” (you can find it here), which is basically a contract that has basic ERC20 functionality (minting, burning, transfer, etc..).
The LP tokens are “stored” inside of this UniswapV2ERC20.sol. Every time a liquidity provider provides liquidity, some amount of tokens are minted to his address. Inversely, every time the liquidity provider takes liquidity out, the LP tokens are burned.
So, how does this work?
If we think about it from a high level, a liquidity provider can a) provide liquidity and b) take out the liquidity. Uniswap implements this mechanism in two functions: mint()
and burn()
.
Let’s first go with the mint function. You can find the full function implementation here.
In order to calculate the LP tokens that a liquidity provider receives, we first need to know if there is liquidity or if it is the first time someone is providing liquidity.
Here is the piece of code (inside of the mint function) that we are interested in:
The MINIMUM_LIQUIDITY is a constant variable equal to 1000:
To understand this better, we are going to go through an example.
Note: In Ethereum, the currency for computation is WEI (ETH * 10 **18). This also applies to most of the ERC-20 tokens. For simplicity, we will write the numbers without the 18 zeroes, but every time there is a hardcoded number like MAXIMUM_LIQUIDITY = 1000, we will just accommodate that number instead.
Let’s suppose that Alice (a liquidity provider) just created a new DAI/WETH pool.
She will supply 10,000 DAI and 2 WETH:
Before Alice, the total supply of the pool was 0, so the first thing that is going to happen is mint 1000 tokens to the address 0 (this is done to maintain a minimum liquidity):
Note: Every time there is a mint, the totalSupply variable gets updated:
We then get the amount of LP tokens that are going to be minted to the liquidity provider (Alice). For our example, the result was 141.42.
Again, this is the formula to determine that:
Great, so now our pool consists of the following:
Total supply means the total amount of LP tokens there are in circulation (do not confuse it with the x and y reserves).
Now, let’s imagine that Alice wants to withdraw every DAI and WETH she put into the pool. For the sake of simplicity, let’s suppose that the reserves remain the same.
In order for Alice to remove her liquidity, she needs to call the burn function, but prior to that, she needs to transfer all her LP tokens to the pool’s address. Let’s understand how this works. Here is the complete burn function implementation:
If we see line 187, the variable liquidity is the amount of LP tokens that the contract’s address hold. Hence, that is why I mentioned that Alice needs to transfer the LP tokens prior to the function call. Then, in lines 191 and 192 are the amounts of tokens x
and y
to be transferred to Alice.
Following our example, let’s unpack this:
As you can see, amount0 = 10000 and amount1 = 2. That is the initial amount that Alice supplied!
After the amounts are calculated, they are transferred and then burned.
This is the magic behind LP tokens!
As a final note on this topic, if we remember, the protocol charges a 0.3% trading fee. This fee goes to the liquidity providers. The way the fee is technically accrued is by increasing the reserves of x
and y
in every trade. This will positively impact the amount of LP tokens each liquidity provider holds (proportionally to their holdings).
If we go back to the burn function, these two lines of code are what calculates the number of tokens x
and y
to send to the liquidity provider when they want to exchange their LP tokens for the x
and y
tokens.
Every time a trader executes a swap, it needs to send a buffer of 0.3% relative to the size of the trade. That 0.3% increments the reserves of the pool. Therefore, increasing the value of balance0
and balance1
in the burn function.
Now that we have a solid understanding of the individual components let’s go to the actual implementation.
This part will be much faster, as the core topics were already covered.
Protocol Architecture
Uniswap V2 is a binary smart contract system. It is composed of the V2-Core and V2-Periphery.
- V2-Core are the low-level base contracts. These smart contracts are responsible for the system’s functionality.
- V2-Periphery are helper contracts that allow frontend applications and developers to integrate with the core contracts by applying safety checks and abstracting away certain things.
In simpler terms, V2-Core is the part of the protocol that implements the core features (swapping, minting, burning, etc.). In contrast, V2-Periphery is one layer up. It is a set of contracts and libraries that make the integration easier for the developers.
The Uniswap team also architected this modular codebase approach to reduce security-critical vulnerabilities by having the bare minimum in V2-Core. In other words, they only implemented what is necessary for V2-Core, so there is less code to audit (therefore more secure). All the helper functions were thrown to V2-Periphery.
We will only focus on V2-Core. The simple reason is that if you know how V2-Core works, you also know how V2-Periphery works (but not the other way around).
Again, you can find the V2-Core repo here.
There are three main contracts:
UniswapV2ERC20.sol
: This contract is responsible for handling the LP tokens. It is a basic ERC20 contract, so we will not go over this.UniswapV2Factory.sol
: This is the factory that is responsible for deploying new pools (pairs) and also to keep track of them.UniswapV2Pair.sol
: This is where the action happens (the pool implementation).
Uniswap V2 Factory
Again, the factory contract is mainly responsible for creating new contract pairs (UniswapV2Pair.sol). Here is the V2 factory contract on Etherscan.
In order to concentrate liquidity, there can only be one smart contract per pair. In other words, if there is a WETH/UNI pair contract already, the factory won’t allow you to create the same pair. Of course, you can bypass that (by deploying the pair contract directly), but the core principle here is to concentrate liquidity as much as possible to avoid price slippage and have more liquidity.
Here is the function that creates pairs in UniswapV2Factory:
One thing that’s probably still in question is, what determines the order of x
and y
?
The pairs are grouped by hexadecimal order, as you can see in line 25:
Line 27 checks that the pair is unique:
Lines 30–32 create the pair (UniswapV2Pair.sol) using create2:
Then it initializes the contract with the address of each token and adds them to an array and mapping.
Another important point is that the factory can turn on a fee to charge a percentage per swap (you can check that yourself; it is pretty trivial).
That’s it for the factory. It is a very simple and straightforward factory implementation, nothing complicated.
Uniswap V2 Pair
UniswapV2Pair.sol
is the foundation of UniswapV2. Here is the contract.
This contract is responsible for handling unique pools (each pool = one contract).
The basic functionality of this contract is to swap
, mint
, and burn
. We already went into detail as to how these components work, so we will just give them one quick brush.
Swap()
: The swap function is the star of Uniswap. Every time you want to trade one token for another, the function gets called. You can see the full function implementation here. The basic task of this function is to enforce that newK
is greater than or equal to k
:
Mint()
: The mint function is responsible for minting LP tokens every time a liquidity provider provides liquidity. You can find it here.
Burn()
: The burn function is responsible for burning LP tokens every time a liquidity provider wants to take his assets out of the pool. You can find it here.
You made it!
You just learned how most automated market makers’ engine works!
Note on Flash Loans
Flash loans are a very important topic that we did not cover in this post. The reason why is that we are going to cover it in depth in the next post about AAVE.
Hopefully, you enjoyed it!
References: