DSA Interview Questions and Answers
DSA Interview Questions and Answers
DSA Interview Questions and Answers
The Data Structure is the way data is organized (stored) and manipulated for retrieval
and access. It also defines the way different sets of data relate to one another,
establishing relationships and forming algorithms.
1. Lists: A collection of related things linked to the previous or/and following data items.
3. Records: A collection of fields, each of which contains data from a single data type.
4. Trees: A data structure that organizes data in a hierarchical framework. This form of
data structure follows the ordered order of data item insertion, deletion, and
modification.
5. Tables: The data is saved in the form of rows and columns. These are comparable to
records in that the outcome or alteration of data is mirrored across the whole table.
A data structure is linear if all its elements or data items are arranged in a sequence or
a linear order. The elements are stored in a non-hierarchical way so that each item has
successors and predecessors except the first and last element in the list.
Examples of linear data structures are Arrays, Stack, Strings, Queue, and Linked List.
The difference lies in the memory area accessed. Storage structure refers to the data
structure in the memory of the computer system, whereas file structure represents the
storage structure in the auxiliary memory.
1. Row-Major Order: -In row-major ordering, all of the rows of a 2D array are stored in
memory in a contiguous manner.
First, the first row of the array is entirely stored in memory, followed by the second row
of the array, and so on until the final row.
This is one of the most frequently asked data structure interview questions where the
interviewer expects you to give a thorough answer. Try to explain as much as possible
rather than finishing your answer in a sentence!
It’s a linear Data Structure or a sequence of data objects where elements are not stored
in adjacent memory locations. The elements are linked using pointers to form a chain.
Each element is a separate object, called a node. Each node has two items: a data
field and a reference to the next node. The entry point in a linked list is called the head.
Where the list is empty, the head is a null reference and the last node has a reference
to null.
A linked list is a dynamic data structure, where the number of nodes is not fixed, and the
list has the ability to grow and shrink on demand.
We deal with an unknown number of objects or don’t know how many items are in the list
The algorithm requires a data structure where objects need to be stored irrespective of their
physical address in memory
Some implementations are stacks and queues, graphs, directory of names, dynamic
memory allocation, and performing arithmetic operations on long integers.
Linked lists are considered both linear and non-linear data structures depending upon
the application they are used for. When used for access strategies, it is considered as a
linear data-structure. When used for data storage, it is considered a non-linear data
structure.
10. What are the advantages of a linked list over an array? In which scenarios
do we use Linked List and when Array?
Insertion and deletion of nodes is an easier process, as we only update the address
present in the next pointer of a node. It’s expensive to do the same in an array as the
room has to be created for the new elements and existing elements must be shifted.
As a linked list is a dynamic data structure, there is no need to give an initial size as it
can grow and shrink at runtime by allocating and deallocating memory. However, the
size is limited in an array as the number of elements is statically stored in the main
memory.
3. No Wastage of Memory
As the size of a linked list can increase or decrease depending on the demands of the
program, and memory is allocated only when required, there is no memory wasted. In
the case of an array, there is memory wastage. For instance, if we declare an array of
size 10 and store only five elements in it, then the space for five elements is wasted.
4. Implementation
Data structures like stack and queues are more easily implemented using a linked list
than an array.
When we want to insert items in the middle of the list, such as when implementing a priority
queue
Some scenarios in which we use array over the linked list are:
When we know the number of elements in the array beforehand, so we can allocate the
correct amount of memory
When we need speed when iterating through all the elements in the sequence
When memory is a concern; filled arrays use less memory than linked lists, as each element
in the array is the data but each linked list node requires the data as well as one or more
pointers to the other elements in the linked list
It is a complex type (double-ended LL) of a linked list in which a node has two links, one
that connects to the next node in the sequence and another that connects to the
previous node. This allows traversal across the data elements in both directions.
Examples include:
The undo and redo functionality on a browser, where you can reverse the node to get to the
previous page
Using an indexed loop, we may access all of the elements in a one-dimensional array.
The counter counts down from 0 to the maximum array size, n, minus one. The loop
counter is used as the array subscript to refer to all items of the one-dimensional array
in succession.
They are collections of data in memory that expand and contract to grow or shrink in
size as a program runs. This enables the programmer to control exactly how much
memory is to be utilized.
Examples are the dynamic array, linked list, stack, queue, and heap.
A problem can be solved in more than one way using several solution algorithms.
Algorithm analysis provides an estimation of the required resources of an algorithm to
solve a specific computational problem. The amount of time and space resources
required to execute is also determined.
The time complexity of an algorithm quantifies the amount of time taken for an algorithm
to run as a function of the length of the input. The space complexity quantifies the
amount of space or memory taken by an algorithm, to run as a function of the length of
the input.
A stack is an abstract data type that specifies a linear data structure, as in a real
physical stack or piles where you can only take the top item off the stack in order to
remove things. Thus, insertion (push) and deletion (pop) of items take place only at one
end called top of the stack, with a particular order: LIFO (Last In First Out) or FILO (First
In Last Out).
Syntax parsing
String reversal
Parenthesis checking
Backtracking
A stack is a linear data structure that operates on the same concept, in that components
in a stack are added and deleted only from one end, referred to as the TOP. As a result,
a stack is known as a LIFO (Last-In-First-Out) data structure because the piece that
was put last is the first to be removed.
1. PUSH: The push action inserts a new element into the stack. The new feature is placed at
the top of the stack. However, before inserting the value, we must first verify if TOP=MAX–1,
since if so, the stack is filled, and no more insertions are possible. An OVERFLOW message
is printed if an attempt is made to put a value into an existing stack.
2. POP: The pop operation is performed to remove the stack's topmost element. However,
before removing the value, we must first verify if TOP=NULL, since if it is, the stack is empty,
and no further deletions are permitted. An UNDERFLOW notice is produced if an attempt is
made to erase a value from a stack that is already empty.
3. PEEK: A peek action returns the value of the stack's topmost element without removing it
from the stack. On the other hand, the Peek operation first checks if the stack is empty, i.e., if
TOP = NULL, then an appropriate message is written. Otherwise, a value is returned.
A postfix expression is made up of operators and operands, with the operator coming
after the operands. That is, in a postfix expression, the operator comes after the
operands. Likewise, what is the proper postfix form? The correct postfix phrase is A B +
C *.
In this data structure interview question, you can also discuss your experience and
situations using queue. A queue is an abstract data type that specifies a linear data
structure or an ordered list, using the First In First Out (FIFO) operation to access
elements. Insert operations can be performed only at one end called REAR and delete
operations can be performed only at the other end called FRONT.
As waiting lists for a single shared resource in a printer, CPU, call center systems, or image
uploads; where the first one entered is the first to be processed
In the asynchronous transfer of data; or example pipes, file IO, and sockets
To maintain the playlist in media players (to add or remove the songs)
The front is used to get the value of the first data item but does not remove it
In this data structure interview questions, try giving various advantages, along with
examples, if possible. It will show the interviewer your domain expertise. Generally,
both heap and stack are part of memory and used in Java for different needs:
Heap is more flexible than the stack because memory space can be dynamically
allocated and de-allocated as needed
Heap memory is used to store objects in Java, whereas stack memory is used to
store local variables and function call
Objects created in the heap are visible to all threads, whereas variables stored in
stacks are only visible to the owner as private memory
When using recursion, the size of heap memory is more whereas it quickly fill-ups
stack memory
Expression evaluation
Backtracking
Memory management
The acronyms stand for Pushing and Popping operations performed on a stack. These
are ways data is stored and retrieved.
PUSH is used to add an item to a stack, while POP is used to remove an item.
PUSH takes two arguments, the name of the stack to add the data to and the value
of the entry to be added. POP only needs the name of the stack.
When the stack is filled and another PUSH command is issued, you get a stack
overflow error, which means that the stack can no longer accommodate the last
PUSH. In POP, a stack underflow error occurs when you’re trying to POP an already
empty stack.
A single sorting algorithm can’t be considered best, as each algorithm is designed for a
particular data structure and data set. However, the QuickSort algorithm is generally
considered the fastest because it has the best performance for most inputs.
Cache-efficient: It linearly scans and linearly partitions the input. This means we can
make the most of every cache load.
Can skip some swaps: As QuickSort is slightly sensitive to input that is in the right
order, it can skip some swaps.
Merge sort is a divide-and-conquer algorithm for sorting the data. It works by merging
and sorting adjacent data to create bigger sorted lists, which are then merged
recursively to form even bigger sorted lists until you have one single sorted list.
Selection sort works by repeatedly picking the smallest number in ascending order from
the list and placing it at the beginning. This process is repeated moving toward the end
of the list or sorted subarray.
Scan all items and find the smallest. Switch over the position as the first item. Repeat
the selection sort on the remaining N-1 items. We always iterate forward (i from 0 to N-
1) and swap with the smallest element (always i).
performance." The purpose is to identify the best case, worst case, and average-case
times for completing a particular activity. While not a deep learning training technique,
Asymptotic analysis is an essential diagnostic tool for programmers to analyze an
algorithm's efficiency rather than its correctness.
Quicksort is the name of a sorting algorithm. The method selects a pivot element and
rearranges the array elements so that all items less than the pivot chosen element go to
the left side of the pivot and all elements more significant than the pivot element move
to the right side.
Merge Sort is a sorting algorithm as well. The algorithm divides the array into two
halves, sorts them recursively, and then combines the two sorted halves. The goal of
points that are closest together is to identify the nearest pair of points in an x-y plane
collection of points. The issue may be solved in O(n2) time by computing the distances
between each pair of locations and comparing them to determine the shortest distance.
Transport grids where stations are represented as vertices and routes as the edges
of the graph
Utility graphs of power or water, where vertices are connection points and edge the
wires or pipes connecting them
Social network graphs to determine the flow of information and hotspots (edges and
vertices)
Neural networks where vertices represent neurons and edge the synapses between
them
A tree is referred to as a generic tree if its hierarchy is not constrained. In the General
Tree, each node can have an endless number of offspring, and all other trees are
subsets of the tree.
The binary tree is a type of tree in which each parent has at least two offspring. The
children are referred to as the left and right youngsters. This tree is more popular than
most others. When specific limitations and features are given to a Binary tree, various
trees such as AVL tree, BST (Binary Search Tree), RBT tree, and so on are also
utilized.
Binary Search Tree (BST) is a binary tree extension that includes numerous optional
constraints. In BST, a node's left child value should be less than or equal to the parent
value, while the correct child value should always be higher than or equal to the parent's
value.
The AVL tree is a self-balancing binary search tree. The term AVL is given in honor of
the inventors Adelson-Velshi and Landis. This was the first tree to achieve dynamic
equilibrium. Each node in the AVL tree is assigned a balancing factor based on whether
the tree is balanced or not. The node kids have a maximum height of one AVL vine.
Red-black trees are another type of auto-balancing tree. The red-black term is derived
from the qualities of the red-black tree, which has either red or black painted on each
node. It helps to keep the forest in balance. Even though this tree is not perfectly
balanced, the searching process takes just O (log n) time.
In this sort of tree with a node, N is the maximum number of children. A binary tree is a
two-year tree since each binary tree node has no more than two offspring. A full N-ary
tree is one in which the children of each node are either 0 or N.
A binary tree is a tree data structure made up of nodes, each of which has two offspring,
known as the left and right nodes. The tree begins with a single node called the root.
Data
37. What are the differences between the B tree and the B+ tree?
The B tree is a self-balancing m-way tree, with m defining the tree's order. Depending
on the number of m, Btree is an extension of the Binary Search tree in which a node
can have more than one key and more than two children. The data is provided in the B
tree in a sorted manner, with lower values on the left subtree and higher values on the
right subtree.
The B+ tree is an advanced self-balanced tree since every path from the tree's root to
its leaf is the same length. The fact that all leaf nodes are the same length indicates that
they all occur at the same level. Specific leaf nodes can’t appear at the third level, while
others appear at the second level.
38. What are the advantages of binary search over a linear search?
In a sorted list:
A binary search is more efficient than a linear search because we perform fewer
comparisons. With linear search, we can only eliminate one element per comparison
each time we fail to find the value we are looking for, but with the binary search, we
eliminate half the set with each comparison.
Binary search runs in O(log n) time compared to linear search’s O(n) time. This
means that the more of the elements present in the search array, the faster is binary
search compared to a linear search.
An AVL (Adelson, Velskii, and Landi) tree is a height balancing binary search tree in
which the difference of heights of the left and right subtrees of any node is less than or
equal to one. This controls the height of the binary search tree by not letting it get
skewed. This is used when working with a large data set, with continual pruning through
insertion and deletion of data.
Null indicates an empty value for a variable, whereas void indicates pointers that
have no initial size
Null means it never existed; Void means it existed but is not in effect
Dynamic memory allocation stores simple structured data types at runtime. It has the
ability to combine separately allocated structured blocks to form composite structures
that expand and contract as needed, thus helping manage data of data blocks of
arbitrary size, in arbitrary order.
42. Name the ways to determine whether a linked list has a loop.
Using hashing
Using the visited nodes method (with or without modifying the basic linked list data
structure)
Sparse matrix
Index generation
It is an array whose elements themselves are arrays and may be of different dimensions
and sizes.
It is a type of heap data structure where the value of the root node is greater than or
equal to either of its child nodes.
The height of the node equals the number of edges in the longest path to the leaf from
the node, where the depth of a leaf node is 0.