Rupesh Kumar SinghMojo -a new programming languageMojo-a new programming language, 35000 times Faster than Python2 min read·May 19, 2023----
Rupesh Kumar SinghCreate increasing id column to pyspark data-framewithout using row_number(), monotonically_increasing_id create incremental id with help of using zipWithIndex() rdd function.1 min read·Apr 1, 2023----
Rupesh Kumar SinghAzure Data lake storage (ADLS) and Azure Blob Storage:Azure Blob Storage: Binary Large Object Storage use to store unstructured data such as text or binary data, you can store large amount of…2 min read·Feb 27, 2023----
Rupesh Kumar SinghAzure Subscription & Resource GroupAzure subscription: Azure subscription generally use for billing management purpose, in one Azure account you can create multiple…1 min read·Feb 27, 2023----
Rupesh Kumar Singhpyspark Pivot Examplepyspark Python code for pivot process, we take example product store information and apply pivot function.1 min read·Mar 25, 2021----
Rupesh Kumar SinghSqoop split by string columnSqoop split by string column1 min read·Sep 20, 2020----
Rupesh Kumar SinghInstallation of Hadoop single node cluster 3.1.4 in ubuntu 20.04Step 1: Installation of openJDK-82 min read·Sep 15, 2020----