{"version":1,"pages":[{"id":"D71mcfZ2Vglavnjy2aWH","title":"About","pathname":"/","siteSpaceId":"sitesp_Z1YgM","description":"This page tells you what our vision and intention for this book is and how you can help in making it better."},{"id":"JwVXZ7oImKjjHR33IBVa","title":"Log","pathname":"/log","siteSpaceId":"sitesp_Z1YgM","description":"The journey of the book so far"},{"id":"OpsSnNBEDlPVEZBdE76O","title":"Mathematical Motivation","pathname":"/mathematical-motivation","siteSpaceId":"sitesp_Z1YgM","description":"This page contains a preliminary discussion into what the different mathematical concepts are and how they relate to data science."},{"id":"I8xVmy2h0Kw7oniCTXly","title":"Probability Basics","pathname":"/statistics/probability-basics","siteSpaceId":"sitesp_Z1YgM","description":"Probability theory is the mathematical foundation of statistical inference, which is indispensable for analyzing data affected by chance, and thus essential for data scientists.","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"lmoZ4V3NFskP5gtnWSO5","title":"Probability Distribution","pathname":"/statistics/probability-distribution","siteSpaceId":"sitesp_Z1YgM","description":"Knowing the distribution of data helps us better model the world around us. It helps us to determine the likeliness of various outcomes or make an estimate of the variability of an occurrence.","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"D0NIXYU3t5LWKiDXoWt6","title":"Central Limit Theorem","pathname":"/statistics/central-limit-theorem","siteSpaceId":"sitesp_Z1YgM","description":"The theorem gives us the ability to quantify the likelihood that our sample will deviate from the population without having to take any new sample to compare it with.","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"Jr8VPYLb21ONABqIMY78","title":"Bayesian vs Frequentist Reasoning","pathname":"/statistics/bayesian-vs-frequentist-reasoning","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"DkwmnceaoY96Rr2ipZCY","title":"Hypothesis Testing","pathname":"/statistics/hypothesis-testing","siteSpaceId":"sitesp_Z1YgM","description":"Hypothesis testing is the process used to evaluate the strength of evidence from the sample and provides a framework for making determinations related to the population","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"KTSVj5oS4EtoAcRmyd4U","title":"A/B test","pathname":"/statistics/a-b-test","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","breadcrumbs":[{"label":"STATISTICS"}]},{"id":"6S0WyBv85zf1zHLvIA6W","title":"Overview","pathname":"/model-building/overview","siteSpaceId":"sitesp_Z1YgM","description":"This page broadly summarizes the steps needed to go from data gathering to model building","breadcrumbs":[{"label":"MODEL BUILDING"}]},{"id":"uVE0bMR18cMlPsZidt7l","title":"Data","pathname":"/model-building/data","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MODEL BUILDING"}]},{"id":"GlA5EqiXg6LUJxBsxFWz","title":"Scaling","pathname":"/model-building/data/scaling","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MODEL BUILDING"},{"label":"Data"}]},{"id":"zKJFQE0V1TEq6FXtR1Mq","title":"Missing Value","pathname":"/model-building/data/missing-value","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MODEL BUILDING"},{"label":"Data"}]},{"id":"9hJtWOzDmMltEZP1S2fZ","title":"Outlier","pathname":"/model-building/data/outlier","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MODEL BUILDING"},{"label":"Data"}]},{"id":"M4D0miThopRZ4Vxk2WmI","title":"Sampling","pathname":"/model-building/data/sampling","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","description":"","breadcrumbs":[{"label":"MODEL BUILDING"},{"label":"Data"}]},{"id":"XeyIxGfSX0yHGbPd2xzc","title":"Categorical Variable","pathname":"/model-building/data/categorical-variable","siteSpaceId":"sitesp_Z1YgM","description":"In the realm of data analysis, categorical variables play a vital role in representing non-numeric data. To utilize these variables effectively it is essential to convert them into numerical form.","breadcrumbs":[{"label":"MODEL BUILDING"},{"label":"Data"}]},{"id":"fP7yG6vlp2nbq0nadA4H","title":"Hyperparameter Optimization","pathname":"/model-building/hyperparameter-optimization","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MODEL BUILDING"}]},{"id":"3K2ScQ1FrhHxNS8EcPjC","title":"Overview","pathname":"/algorithms/overview","siteSpaceId":"sitesp_Z1YgM","description":"This page discusses the building blocks of an algorithm.","breadcrumbs":[{"label":"Algorithms"}]},{"id":"1GbzbyQobAMv4SaheJCJ","title":"Bias/Variance Tradeoff","pathname":"/algorithms/bias-variance-tradeoff","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"XR36IyerY7tY8UWK4WIl","title":"Regression","pathname":"/algorithms/regression","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"xwXVHC9ckqcHvClvlehj","title":"Generative vs Discriminative Models","pathname":"/algorithms/generative-vs-discriminative-models","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"zExkmwFdeaVpW05agcgq","title":"Classification","pathname":"/algorithms/classification","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"vQJHgH6t4cq555zqDco7","title":"Clustering","pathname":"/algorithms/clustering","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","breadcrumbs":[{"label":"Algorithms"}]},{"id":"rDyncuvET656Xd9GoYOt","title":"Tree based approaches","pathname":"/algorithms/tree-based-approaches","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"Uc3q2gCFcj9VgauFbjQg","title":"Time Series Analysis","pathname":"/algorithms/time-series-analysis","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"kz4ZaaTFZqyEDoshQCTf","title":"Anomaly Detection","pathname":"/algorithms/anomaly-detection","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"h6qH4de3DHTVqiUPIxFo","title":"Big O","pathname":"/algorithms/big-o","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"Algorithms"}]},{"id":"OzkcIl1Z60Xk4bJpAgck","title":"Neural Network","pathname":"/neural-network/neural-network","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"NEURAL NETWORK"}]},{"id":"y2IiaRL2StC59yHxJ637","title":"Recurrent Neural Network","pathname":"/neural-network/recurrent-neural-network","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","description":"","breadcrumbs":[{"label":"NEURAL NETWORK"}]},{"id":"mcA0eD0Z1SSfYkcxWVFC","title":"Lexical Processing","pathname":"/nlp/lexical-processing","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"NLP"}]},{"id":"MnLNQBj832yn7vi2MEUm","title":"Syntactic Processing","pathname":"/nlp/syntactic-processing","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"NLP"}]},{"id":"IXdTqUGa55dYbT6dAtzd","title":"Transformers","pathname":"/nlp/transformers","siteSpaceId":"sitesp_Z1YgM","description":"A summary of transformers and why it makes it important in Data Science","breadcrumbs":[{"label":"NLP"}]},{"id":"ZXjGEPYLbUmaOhkAMQA8","title":"Power BI","pathname":"/business-intelligence/power-bi","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","description":"Overview of Power BI and its core components.","breadcrumbs":[{"label":"BUSINESS INTELLIGENCE"}]},{"id":"r8hTdXiriNFuqVOvpMId","title":"Charts","pathname":"/business-intelligence/power-bi/charts","siteSpaceId":"sitesp_Z1YgM","description":"Common Power BI charts and when to use them.","breadcrumbs":[{"label":"BUSINESS INTELLIGENCE"},{"label":"Power BI","emoji":"26a0"}]},{"id":"W5UShENKCEbirY2NAaSE","title":"Problems","pathname":"/business-intelligence/power-bi/problems","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"BUSINESS INTELLIGENCE"},{"label":"Power BI","emoji":"26a0"}]},{"id":"37m8FBGCEa7svKtB5Bux","title":"Visualization","pathname":"/business-intelligence/visualization","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"BUSINESS INTELLIGENCE"}]},{"id":"0EfzfWlBLDFVPJVzPYB3","title":"Theoretical","pathname":"/python/theoretical","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"PYTHON"}]},{"id":"PPsulzktL7y6UJUCwoMQ","title":"Basics","pathname":"/python/basics","siteSpaceId":"sitesp_Z1YgM","description":"This page deals with Basic Python Questions","breadcrumbs":[{"label":"PYTHON"}]},{"id":"r8SQaPBfS40ldtTLsQTt","title":"Data Manipulation","pathname":"/python/data-manipulation","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"PYTHON"}]},{"id":"l8jZNlPVkFvBbmYTOAdI","title":"Statistics","pathname":"/python/statistics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"PYTHON"}]},{"id":"69T2jf7CBt0rgIPt5hFL","title":"NLP","pathname":"/python/nlp","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"PYTHON"}]},{"id":"DkDvdIWphMb1uGadteoz","title":"Algorithms from scratch","pathname":"/python/algorithms-from-scratch","siteSpaceId":"sitesp_Z1YgM","description":"Often companies ask to code different Algorithms from scratch as a part of their craft demo round.","breadcrumbs":[{"label":"PYTHON"}]},{"id":"M99DFyYiOGbkx1m8xHX6","title":"Linear Regression","pathname":"/python/algorithms-from-scratch/linear-regression","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"PYTHON"},{"label":"Algorithms from scratch"}]},{"id":"kkBVHX73wfjGtceEP5jJ","title":"Logistic Regression","pathname":"/python/algorithms-from-scratch/logistic-regression","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"PYTHON"},{"label":"Algorithms from scratch"}]},{"id":"E7z2w1hQaTuPQenX9IAS","title":"PySpark","pathname":"/python/pyspark","siteSpaceId":"sitesp_Z1YgM","description":"A brief overview of PySpark","breadcrumbs":[{"label":"PYTHON"}]},{"id":"MjDc8XlznCTcjSE0fbae","title":"Overview","pathname":"/ml-ops/overview","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"ML OPS"}]},{"id":"AqEKuNTwRtvOX19MnKRJ","title":"GIT","pathname":"/ml-ops/git","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"ML OPS"}]},{"id":"kYAnYV0xo28LdDBAdEMP","title":"Feature Store","pathname":"/ml-ops/feature-store","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"ML OPS"}]},{"id":"afHHkp8NWOhFsCOBCTpd","title":"Basics","pathname":"/sql/basics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"1mHt4AzmqMz28sjt8J92","title":"Joins","pathname":"/sql/joins","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"bLDKETqFPdyNaKEmXxVI","title":"Temporary Datasets","pathname":"/sql/temporary-datasets","siteSpaceId":"sitesp_Z1YgM","description":"A summary of Temp Table vs Table variable vs CTE","breadcrumbs":[{"label":"SQL"}]},{"id":"EuaZ1idxNUDpvomT0la4","title":"Windows Functions","pathname":"/sql/windows-functions","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"KGpsVmTMtX8kCgO2OagE","title":"Time","pathname":"/sql/time","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"UeIpT5ROSZBgZd0otkYm","title":"Functions & Stored Proc","pathname":"/sql/functions","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"yatSesAF6zUXOJSWgM1A","title":"Index","pathname":"/sql/index","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"SQL"}]},{"id":"RAkCIpq4bttz5lHA2Oud","title":"Performance Tuning","pathname":"/sql/performance-tuning","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"SQL"}]},{"id":"Navx9yY3SPvrbzSbXqLk","title":"Problems","pathname":"/sql/problems","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"SQL"}]},{"id":"HAY7kL8zZdTBnihpUdBN","title":"Excel Basics","pathname":"/excel/excel-basics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"EXCEL","emoji":"26a0"}]},{"id":"R4fPL9rw0Q1fU7hHkIM8","title":"Data Manipulation","pathname":"/excel/data-manipulation","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"EXCEL","emoji":"26a0"}]},{"id":"cUagChg4QCNjhUdQjepU","title":"Time and Date","pathname":"/excel/time-and-date","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"EXCEL","emoji":"26a0"}]},{"id":"RdOX22LV4C2CGn0TAH44","title":"Python in Excel","pathname":"/excel/python-in-excel","siteSpaceId":"sitesp_Z1YgM","description":"Anaconda and Microsoft announced a groundbreaking innovation: Python in Excel. This marks a transformation in how Excel users and Python practitioners approach their work.","breadcrumbs":[{"label":"EXCEL","emoji":"26a0"}]},{"id":"e9FKwHUWbVHDfLf3lFWG","title":"PyCaret","pathname":"/machine-learning-frameworks/pycaret","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"MACHINE LEARNING FRAMEWORKS"}]},{"id":"kSG1RKqQLG51PDpUe4B8","title":"Tensorflow","pathname":"/machine-learning-frameworks/tensorflow","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","description":"","breadcrumbs":[{"label":"MACHINE LEARNING FRAMEWORKS"}]},{"id":"VX4Mq8sAeit3FH50lxgA","title":"Business Scenarios","pathname":"/analytical-thinking/business-scenarios","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"ANALYTICAL THINKING"}]},{"id":"h7hB5jXohYYKtmBOZhJv","title":"Industry Application","pathname":"/analytical-thinking/industry-application","siteSpaceId":"sitesp_Z1YgM","emoji":"26a0","description":"","breadcrumbs":[{"label":"ANALYTICAL THINKING"}]},{"id":"v9UzEvcPn1cNuPwBybmb","title":"Behavioral/Management","pathname":"/analytical-thinking/behavioral-management","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"ANALYTICAL THINKING"}]},{"id":"EkAdfjSUpRGmhPT5gRU9","title":"Vector Database","pathname":"/generative-ai/vector-database","siteSpaceId":"sitesp_Z1YgM","description":"With the rise of Foundational Models, Vector Databases skyrocketed in popularity. Vector Database is also useful outside of a Large Language Model context.","breadcrumbs":[{"label":"Generative AI"}]},{"id":"ZAuIwQWP7c8mhUXxmYL0","title":"LLMs","pathname":"/generative-ai/llms","siteSpaceId":"sitesp_Z1YgM","description":"An overview of large language models (LLMs), covering LLM prompting, LLM fine-tuning, and LLM application development.","breadcrumbs":[{"label":"Generative AI"}]},{"id":"6oDcWrZX4EoSWM5NEOl6","title":"NumPy","pathname":"/cheat-sheets/numpy","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"FnB5nTeLqvBzgDAgO6g9","title":"Pandas","pathname":"/cheat-sheets/pandas","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"V12UcJMbr5V7jYmCWslY","title":"Pyspark","pathname":"/cheat-sheets/pyspark","siteSpaceId":"sitesp_Z1YgM","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"XF6yFbc7TIydNZgQ8XQe","title":"SQL","pathname":"/cheat-sheets/sql","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"rkHKt4MDfEt9WzmIwKOG","title":"Statistics","pathname":"/cheat-sheets/statistics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"hAL1Gz1FtLUKngbxHgRR","title":"RegEx","pathname":"/cheat-sheets/regex","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"Rl5hOxiWJCqeJ9CDpBcj","title":"Git","pathname":"/cheat-sheets/git","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"L4ymHhUueHPhUztONaI6","title":"Power BI","pathname":"/cheat-sheets/power-bi","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"spWrz90HQle6qyeMVZ3l","title":"Python Basics","pathname":"/cheat-sheets/python-basics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"fzjrmhYHCxKXB2lgvLoE","title":"Keras","pathname":"/cheat-sheets/keras","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"01qEyPDXO5NckJl2rYYe","title":"R Basics","pathname":"/cheat-sheets/r-basics","siteSpaceId":"sitesp_Z1YgM","description":"","breadcrumbs":[{"label":"CHEAT SHEETS"}]},{"id":"JXDeWU0px0HGaD7ONPgY","title":"PRIVACY NOTICE","pathname":"/policies/privacy-notice","siteSpaceId":"sitesp_Z1YgM","description":"The short version is that we do not collect personal information. We use Google analytics for things like country or device our users are accessing from & which pages they are visiting. That's all.","breadcrumbs":[{"label":"POLICIES"}]}]}