Published on February 29, 2024

SAML 2.0 Authentication

SAML stands for security assertion markup language. It is a widely used open standard for authentication and authourization between different parties. The advantages of adopting SAML/SSO include single source of identity and enable consistent authentication across services. Centralizing authentication simplifies granting and revoking access across different services. The parties involved in the SAML authentication flow include the principal, the service provider (SP), and the identity provider (IdP). The principal is the user trying to authenticate and tends to have metadata associated to them. Service providers are the services requesting authentication and identity information about the principal. Lastly, identity providers serve as the source of identity information and authentication decision. The relationship between the three can be thought of as IdP authenticates principals and returns identity information to SP. Before authentication can happen, both SP and IdP need to coordinate and exchange setup information. IdP will need to provide the SSO endpoint and public key to the SP for identity decryption upon successful authentication. In exchange, the SP will need to provide IdP an assertion consumer service (ACS) URL. The ACS URL is where the IdP will POST the SAML assertion to along with redirecting the principal to a defined target resource after authentication. The SAML authentication flow starts with the user (the principal) clicking on an "Continue with SSO" button on the SP's landing page. The SP will create and serialize an XML document known as the AuthnRequest, and appends to the IdP SSO endpoint as query parameter before redirecting the principal to the IdP SSO endpoint. The principal will then enter their credentials for authentication. Upon successful authentication, the IdP encrypts the principal's identity information with its private key. It then redirects the principal back to the ACS URL, passing along the encrypted identity information as part of a SAML payload. The SP will receive and decrypt the payload using the corresponding public key before processing the SAML by creating or configuring user session based on SAML. Sequence Diagram saml-flow-sequence-diagram

Authentication Identity Provider Principal SAML Service Provider SSO Tech User Authentication

Published on December 7, 2023

Redis Use Cases

Redis is a well-known in-memory key-value store typically used as a cache system. However, there are many other use cases for Redis. It is important to note that since Redis is an in-memory database, all data will be lost if the Redis server restarts or crashes. For this reason, Redis provides the option to persist data to disc. While Redis allows data persistence to disk, it’s not the most efficient solution for recovering from crashes. Maintaining separate replicas for promotion as the primary instance offers faster recovery. As a cache, Redis enables efficient retrieval of frequently accessed data. This will reduce the load on the database and improve the response time of the application. Redis is also used as a session store. Normally, session data is persisted with the instance by which the user logs in. This means that the user is logged in to that instance only. This is not stateless and makes horizontal scaling very difficult. Redis enables decoupling session data from each instance and removing the need for each machine to remember the session state information. A simple rate limiter can also be implemented using Redis. At a very high level, this is done by mapping user IP to a counter with an expiration policy. If the current count exceeds the allowed threshold, then the request is blocked until the current count falls below the allowed threshold again. Lastly, Redis can also serve as a distributed lock to protect mutable resources. Suppose there are two clients, A and B, who wish to modify some common resources at the same time, client B can lock the resource by setting a key in Redis. This prevents client A from accessing the said resource until client B releases the key by deleting it from Redis. These are a few examples of what Redis could be used for. Redis’s diverse capabilities and ease of use make it a valuable tool for a wide range of applications.

Cache Distributed Lock Rate Limiter Redis Session Store Tech

Published on December 3, 2023

Slice and Dice a Binary Tree

Hierarchical data are ubiquitous. This makes understanding the techniques on how to process them all the more important. The three standard algorithms are pre-order, in-order, and post-order traversals. This post will look at the mentioned techniques and a few others that will group hierarchical data by rows and columns. We will also explore the top, bottom, right, and left views of a tree. PreOrder, InOrder, and PostOrder Traversals Starting with the standard depth-first traversals, we have pre-order, in-order, and post-order as follows. In depth-first traversal, the algorithm is encouraged to go as far into one branch as possible before backtracking. GroupByDepth Next, we have level-order or group-by-depth. If we only care about printing the node values, then the traditional level-order traversal will do the job. To group the nodes by level, any of the standard depth-first traversals can be used in conjunction with a map of level to list of nodes. A level tracker is also used to track the depth as the algorithm proceeds each subtree. GroupByColumn Vertical-order, or group-by-column, follows a similar implementation to level-order except for one key difference. Instead of tracking depth during the recursive calls, we will add one if we go to the right subtree, and subtract one if we go to the left subtree. Top, Bottom, Left, and Right Views Lastly, getting the right side view of a tree can be done by using one of the depth-first traversals and leveraging the level data to either put the value into a map if it is the first time the level is visited or override the value if the level has been visited before. As the depth-first traversal moves from left to right, the value at the level is overridden until the right-most values remain. Getting the left side view can be accomplished by reversing the traversal order. Getting the top or bottom view relies on the vertical level rather than the depth info.

Binary Tree Binary Tree Traversal DS/A Tech Tree Algorithms

Published on November 30, 2023

Capacity Planning

Envelope calculation is a technique used to check and validate a design idea. The two numbers we want to arrive at are queries per second (QPS) and storage. It is important to note that absolute accuracy is not important and the goal with envelope calculation is to get within the order of magnitude. Queries per second (QPS) is the number of requests we can expect per second. This can help inform the number of servers needed to support the QPS. In general, QPS can be broken down into daily active users (DAU), DAU\_subset, content\_rate, peak\_scale\_factor, and seconds per day. DAU is usually easy to obtain using historical traffic data. DAU\_partition estimates the subset of DAU that this feature is designed for. For example, only 20% of Twitter’s DAU actually tweets. The content\_rate estimates the average amount of content produced or consumed per DAU. Using the given example, there may be 2 tweets per DAU. The peak\_scale\_factor is a multiplier that estimates what the QPS could be during peak traffic hours. Lastly, there are 86400 seconds in a day, but 1e5 can be used to make calculation easier. Putting it all together, the formula looks like: QPS = DAU \* DAU\_subset(%) \* content\_rate \* peak\_scale\_factor \* 1e5 Storage is another number to calculate, which will help inform the storage capacity need for the particular feature being designed. Storage can be broken down into DAU, DAU\_subset, content\_subset, content\_size, retention, copies, and days per year. DAU and DAU\_subset are the same as QPS. The content\_size estimates the average content size being stored. Related to content, it is also important to consider what portion of all content contains the content type of interest and partition appropriately. Retention and copies estimate how long the data will be kept and the number of copies that should be kept. Lastly, there are 365 days in a year, but we will round this to 4e2 for easy calculation. Putting it all together, the formula looks like: Storage = DAU \* DAU\_subset(%) \* content\_subset(%) \* content\_size \* retention(yr) \* copies \* 4e2 Having QPS and Storage will help guide the decisions during system design such as the number of servers, database shards, load balancing, and etc.

Daily Active User (DAU)Envelope Calculation Queries Per Second (QPS)Storage Capacity System Design Tech

Published on November 1, 2023

Improving API Performance

Suppose you noticed that the latency of your API service is slowly creeping up in line with the increase in traffic. You have added additional compute to the load balancing pool to help distribute the load, but it may be time to explore some optimization at the code level. This article will explore 5 of infinite techniques on how to increase the performance of an API service. They are caching, minimizing N + 1 queries, paginating large results, data compression, and avoiding synchronous logging. Caching Starting with caching, which usually lives between the middle tier and the database. The idea is to store the results of expensive computations to be reused at a later request. This can help reduce the number of database hits for frequently accessed endpoints with the same parameters. Minimize N+1 Queries Minimizing N + 1 queries against the database can significantly improve API performance. Often time this problem appears in hierarchical data where you might query for data at one level, then make another query for each of the results. For example, this could mean one query to get a list of posts, and then another query for each of the posts to retrieve a list of comments per post. Pagination Instead of returning the full dataset per query, consider paginating the results and returning a subset of the full dataset. This will improve query time on the data layer, processing time in the middle tier, and network load. Data Compression Data compression can help reduce the size of the response payload and the amount of data being transferred over the network. The client will need to decompress the response payload before using it. Similarly, this will help reduce network load. Avoid Synchronous Logging Lastly, avoid synchronous logging in favor of fire and forget. Synchronous logging will add to the round trip time of an API request. The time it takes to write one log entry is insignificant, it can add up in a high throughput system, especially if the request has multiple points of logging. These are five examples of how to improve an API’s performance. Keep in mind that premature optimization can lead to unnecessary complexity.

Cache Code Optimization Data Compression Latency Optimization N+1 Queries Pagination Tech

Published on October 24, 2023

Solvency Ratios

Solvency ratios are financial metrics that assess a company's ability to service its long-term debt and interest. Solvency ratios are also known as financial leverage ratios that compare the company's debt to its assets, equity, or earnings. In other words, how many times the company can cover its debt given its assets, equity, or earnings. Unlike liquidity ratios, solvency ratios have a longer term outlook. Solvency ratios vary from industry to industry, making them less effective when comparing companies between different industries. However, it can be used to highlight anomalies between peers within the same sector. The four common solvency ratios are Equity Ratio, Debt-to-Equity Ratio, Debt-to-Assets Ratio, and Interest Coverage Ratio. Equity Ratio Also known as Equity-to-Assets Ratio, shows how much a company is funded by equity as opposed to debt. A higher Equity Ratio indicates that a larger portion of the company's assets is funded by shareholders' equity rather than debt, which is favorable for the company's financial health. A lower Equity Ratio suggests that the company relies on debt to finance its business. The Equity Ratio formula is: Equity Ratio = Shareholders' Equity / Total Assets Debt-to-Equity Ratio Debt-to-Equity Ratio (D/E) indicates how much of a company's equity is financed by creditors. Debt-to-Equity Ratio also shows how much of the company's debt can be covered should it liquidates its equity. A lower Debt-to-Equity Ratio is favorable for the company's financial health. The formula for Debt-to-Equity Ratio is: Debt-to-Equity Ratio = Total Debt / Shareholders' Equity Debt-to-Assets Ratio The Debt-to-Assets Ratios assesses a company's total debt to its total assets. It shows how leveraged a company is and is an indication of how much the company is funded by debt relative to its assets. A lower Debt-to-Assets Ratio is favorable for the company's financial health. The formula for Debt-to-Assets Ratio is: Debt-to-Assets Ratio = Total Debt / Total Assets Interest Coverage Ratio This ratio evaluates how many times a company can cover its current interest payments with its available earnings. It represents the company's safety margin for paying its interest on debt over a specific period. A higher Interest Coverage Ratio indicates a greater capacity to service its interest obligation. The formula for Interest Coverage Ratio is: Interest Coverage Ratio = EBIT / Interest Expense

Debt-to-Assets Ratio Debt-to-Equity Ratio Equity Ratio Finance Interest Coverage Ratio Solvency Ratios

Published on October 18, 2023

CAP Theorem

The CAP Theorem is an important consideration when designing a distributed system. CAP stands for Consistency, Availability, and Partition Tolerance. The Consistency property states that all users must see the same data at the same time, regardless of the node to which they are connected. The Availability property ensures that all requests receive a response, even when some nodes are offline. Lastly, the Partition Tolerance property stipulates that a system should continue functioning despite network partitions causing communication loss between nodes. The CAP Theorem asserts that a distributed system cannot simultaneously support all three properties; it must sacrifice one to support the remaining two. This implies that there are three possible types of distributed systems: consistent-partition-tolerant (CP), availability-partition-tolerant (AP), and consistent-availability (CA). In practice, network partition is assumed to occur at some point, so distributed systems must be partition-tolerant. Since Partition Tolerance cannot be sacrificed, CA systems do not exist in practice. A CP system is consistent and partition-tolerant, where data inconsistency is unacceptable. After data is updated, the changes are replicated across all nodes with minimal delay in between. It's important to note that the consistency in CAP does not refer to strong consistency but to eventual consistency, which means that data will eventually be replicated to all nodes. This gap is generally small and considered acceptable. An AP system offers high availability and is partition-tolerant. Data inconsistency is acceptable within an AP system. The primary goal of an AP system is to ensure the successful completion of all requests. This does not necessarily mean that the requests were fulfilled correctly, as some requests may have been processed with stale data. Businesses with AP systems may need to address these discrepancies after the fact.

AP Systems Availability CAP Theorem CA Systems Consistency CP Systems Partition Tolerance Tech

Published on October 15, 2023

Liquidity Ratios

Liquidity ratio is a class of financial metrics used to measure a company's ability to pay off its short-term liabilities, usually due within a year. Another way to think about liquidity ratio is how easily and efficiently a company can convert its assets into cash so it can service its short-term debts. A higher liquidity ratio suggests that the company is able to cover its short-term obligation. A low liquidity ratio would suggest otherwise. Liquidity ratios should not be used in isolation and are best when combined with other factors. When used as internal analysis, liquidity ratios can be used by comparing prior periods to current operations, assuming the same accounting method is used in that timeframe. When used as an external analysis, liquidity ratios can be used to compare different companies within an industry. It is important to keep in mind that it may not be effective to compare the liquidity ratios between companies across different industries, sizes, or geographical locations. The three common liquidity ratios are Current Ratio, Quick Ratio, and cash ratio in the order of least to most conservative. Current Ratio Current Ratio measures a company's ability to pay off its current liabilities. This is the least conservative liquidity ratio as it includes all current assets. The formula for Current Ratio is: Current Ratio = CA / CL CA = Current Assets CL = Current Liabilities Quick Ratio Quick Ratio measures a company's ability to meet its short-term obligations with its most liquid assets. This means excluding inventory from current assets because inventory is les liquid than cash. Quick Ratio is also known as the Acid-Test Ratio. The formula for Quick Ratio is: Quick Ratio = (C + MS + AR) / CL Where C = Cash or Cash Equivalents, MS = Marketable Securities, AR = Accounts Receivable, CL = Current Liabilities Another way to look at Quick Ratio is: Quick Ratio = (CA - I - E\_prepaid) / CL Where CA = Current Assets, I = Inventory, E\_prepaid = Prepaid Expenses, CL = Current Liabilities Cash Ratio Cash Ratio is the most conservative liquidity ratio and considers only cash and cash equivalents in relation to current liabilities. The formula for Cash Ratio is: Cash Ratio = C / CL Where C = Cash or Cash Equivalents, CL = Current Liabilities

Acid-Test Ratio Cash Ratio Current Assets Current Liabilities Current Ratio Finance Liquidity Ratios Quick Ratio

Published on October 12, 2023

Exploring Different Types of Load Balancers

Load balancers are crucial components for enabling large-scale web applications. Some key features of a load balancer include distribution of traffic, health checks, session persistence, etc. There are two classes of load balancer: static and dynamic. Static load balancers are rule-based and do not adapt to changing server conditions, whereas dynamic load balancers continuously monitor server metrics and adjust the load distribution based on these metrics. Static load balancing methods include Round Robin, Sticky Round Robin, Weighted Round Robin, and IP/URL Hash. Dynamic load balancers include Least Connections and Least Time. Round Robin is the simplest approach in traffic distribution. The idea is to evenly distribute requests amongst all the servers regardless of server metrics. The down side to this approach is that server can become overloaded if they are not monitored closely. Sticky Round Robin is a variation of Round Robin. In this approach, the load balancer will attempt to send subsequent requests from the same user to the same server. Related data are grouped and processed together, improving processing performance. Uneven load can easily occur in this approach because newly arrived requests are assigned randomly. Weighted Round Robin allows users to configure the weight of each server within the server pool. The load balancer will distribute traffic to each server in proportion to its assigned weight. The downside is that server weights need to be configured manually, which can make this method less adaptable in a rapidly changing environment. IP/URL Hash load balancing is similar to Sticky Round Robin, as it maps user IP addresses or requested URLs to specific servers. Again, related data are grouped and processed together, improving processing performance. Selecting a proper hash algorithm can be a barrier to entry. Least Connections is a dynamic approach to load balancing, where incoming requests are routed to the server with the fewest active connections, effectively assigning requests to servers with the most remaining capacity. This approach requires that each server track its on going connections. Traffic can unintentionally be piled onto a single server. Least Time is another dynamic approach to load balancing, where the load balancer routes traffic to the server with the lowest latency or fastest response time. This involves continuously measuring the latency from each server, leading to increased overhead and complexity. Each of the mentioned methods has its trade-offs in terms of capabilities, constraints, and performance. It is important to consider traffic pattern when choosing and fine-tuning a load balancing approach.

Dynamic Load Balancing IP/URL Hash Least Connections Least Time Load Balancer Round Robin Static Load Balancing Sticky Round Robin Tech Weighted Round Robin

Forward Proxy and Reverse Proxy

Single Sign-on (SSO)

SAML 2.0 Authentication

Redis Use Cases

Slice and Dice a Binary Tree

Capacity Planning

Oauth 2.0 + OIDC Authentication

Evaluate an Expression

API Gateway

Exploring Different Types of Cache Systems

Convert a Sorted Array into a Binary Search Tree

Improving API Performance

The Largest Value of Each Row of a Binary Tree

Solvency Ratios

CAP Theorem

Get All Paths of a Binary Tree

Liquidity Ratios

Exploring Different Types of Load Balancers

Determine if Binary Tree Contains a Path with Given Sum

Username and Password Authentication Using JWT

Binary Tree Pre-order Traversal

TLS and SSL