Open in app

Sign In

Write

Sign In

Aviral Srivastava
Aviral Srivastava

174 Followers

Home

About

Mar 19, 2021

Understanding UNIX

I wanted to understand how come UNIX is so standardized and developer-friendly. Before engineers at BELL Labs even thought UNIX, the computer industry suffered unproductivity in large-scale projects: too many people, too many inefficiencies, delay, and improper planning. UNIX made computers more productive. Unlike hardware, people demand new or modified…

Unix

6 min read

Unix

6 min read


Aug 28, 2020

Working as an intern at Wingify

This blog post will cover my internship experience at Wingify from Dec ’17 to April ’18. The three reasons I am writing this blog post: A record for myself so that I always value my energy. To attempt providing value to others. Paras Chopra did not forget my promise of…

Internships

11 min read

Internships

11 min read


Jun 29, 2020

Run a Stateful Streaming Service with Apache Flink and RocksDB

What is Flink? Apache Flink is an open-source distributed system platform that performs data processing in stream and batch modes. Being a distributed system, Flink provides fault tolerance for the data streams. What is Stateful Streaming Service? To make our platform fault tolerance, we want to preserve the system’s state from time to time. …

Apache

5 min read

Run a Stateful Streaming Service with Apache Flink and RocksDB
Run a Stateful Streaming Service with Apache Flink and RocksDB
Apache

5 min read


May 13, 2020

Understanding Naiad: A timely dataflow system for batch and stream processing

A large scale distributed system that can support cyclic dataflows. Back in 2013 and even now, every distributed programming model was tightly coupled with the engine that executed this model and so interoperability was an issue. It is an issue even today but not that much — thanks to systems…

Distributed Systems

4 min read

Understanding Naiad: A timely dataflow system for batch and stream processing
Understanding Naiad: A timely dataflow system for batch and stream processing
Distributed Systems

4 min read


Apr 23, 2020

MongoDB from fundamentals

After working on MongoDB for about six years, I am sharing certain practices that have worked well for me. Introduction to MongoDB In relational database designs, the schema is statically defined. In document-based databases such as MongoDB, the schema is dynamic and is based on the document structure. MongoDB is a schemaless database. Designing MongoDB Document Stores …

Mongodb

3 min read

Mongodb

3 min read


Feb 19, 2020

Everything you ever wanted to know about Zookeeper

This blog post will discuss the research paper, Zookeeper: Wait-free coordination for Internet-scale systems. Click here for downloading the paper. What problem does Zookeeper solve? Coordination is of utmost importance in distributed applications. While building distributed systems, we need primitives like distributed lock services, shared registers, group messaging, etc. …

Zookeeper

7 min read

Everything you ever wanted to know about Zookeeper
Everything you ever wanted to know about Zookeeper
Zookeeper

7 min read


Feb 6, 2020

Understanding Merge and Sort Algorithms on Modern Processors

This article explains the research paper, “Patience is a Virtue: Revisiting Merge and Sort on Modern Processors” published by Microsoft Research. I intend to explain the research paper in a simple manner. The reason I chose this algorithm is due to its application scope — infrastructure monitoring and historical analysis…

Microsoft

10 min read

Understanding Merge and Sort Algorithms on Modern Processors
Understanding Merge and Sort Algorithms on Modern Processors
Microsoft

10 min read


Jan 8, 2020

Understanding Rules of Thumb for Computer Storage

A blog post that explains the research paper: The Five-Minute Rule Ten Years Later, and Other Computer Storage Rules of Thumb While working as a data engineer, I often encounter the problem of data storage: whether data access should be kept faster or should I go for cold-storage or something…

Data

4 min read

Understanding Rules of Thumb for Computer Storage
Understanding Rules of Thumb for Computer Storage
Data

4 min read


Published in Towards Data Science

·Dec 11, 2019

Dimensional Data Modeling

Why do you need dimensional data modeling and how to implement it? — What is dimensional data modeling? Dimensional modeling (DM) is part of the Business Dimensional Lifecycle methodology developed by Ralph Kimball which includes a set of methods, techniques and concepts for use in data warehouse design. The approach focuses on identifying the key business processes within a business and modelling and implementing these first before adding…

Analytics

7 min read

Dimensional Data Modeling
Dimensional Data Modeling
Analytics

7 min read


Dec 8, 2019

Big Data Fundamentals needed for a Data Engineer

Working as a Data Engineer, you are directly connected to your data science teams and some senior-level personnel who give you business use-cases. You have to be involved in the complete process to gain awareness of why a particular action should be taken. …

Big Data

11 min read

Big Data Fundamentals needed for a Data Engineer
Big Data Fundamentals needed for a Data Engineer
Big Data

11 min read

Aviral Srivastava

Aviral Srivastava

174 Followers

Code + Data.

Following
  • Richard Red

    Richard Red

  • Terri Mead

    Terri Mead

  • Markham Heid

    Markham Heid

  • Lak Lakshmanan

    Lak Lakshmanan

  • Crosswire

    Crosswire

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech