admin – Shivdeep Nancherla

Toastmasters Club Elections

When I became the President of the Unisys Ritoras Toastmasters club last term, I did not know what to expect. Six months down the line, I must say I am glad I took the plunge! Leaders are not born as popular opinion states, Leaders are made. Opportunities to practice becoming a leader are everywhere, but the ability to practice without causing major disruption is what Toastmasters provides. Just as Toastmasters meetings provide a mutually supportive and positive learning environment where one is not judged, being an office bearer promises to provide an ecosystem to hone ones leadership skills. Being a Leader of a club has given me ample opportunities to learn from difficult situations, and I am getting better with each and every engagement.

The meeting is the stage where you learn speaking, backstage is where you learn leadership.

Skills I continue to sharpen in the process are small group leadership, problem solving, conflict management, compliance with procedures, organizing groups to accomplish tasks/ events, providing tactful and constructive feedback, etc. If you are looking for becoming a better leader, the opportunity is now!

Roles one can pickup in Toastmasters are

President

The President presides at meetings of the Club, has general supervision of the operations of the Club. Serves as one of the Club’s representatives on Area and District Councils.

Vice President Education

Plans and directs club programs which meet the educational needs of the Club members. Plans and publishes regular schedules of meeting assignments. Keeps track of member’s progress towards goals. Serves as one of the Club’s representatives on Area and District Councils.

Vice President Membership

Plans and directs programs to retain and increase club membership. Serves as one of the Club’s representatives on Area and District Councils

Vice President Public Relations

Develops and directs programs that inform individual members and the general public about Toastmasters International and about Club activities.

Secretary

The Secretary is responsible for Club records and correspondence. Maintains the club roster. Has custody of the Club’s charter, Constitution, Bylaws, and all other records and documents of the club. Keeps an accurate record of the meetings and activities of the Club.

Treasurer

The Treasurer is responsible for Club financial policies, procedures and controls. Collects dues and pays dues to Toastmasters International, and maintains records. Makes financial reports to the Club at least quarterly. Receives and disburses, with approval of the Club, all Club funds.

Sergeant at Arms

Prepares meeting room for meeting. Maintains Club property, including banner, nametags, and supplies. Greets visitors. Chairs Social and Reception Committees.

Additional information can be found at the Toastmasters website –

https://www.toastmasters.org/leadership-central/club-officer-tools/club-management/club-quality/club-officer-elections

https://www.toastmasters.org/Leadership-Central/Governing-Documents

AWS and How I got myself AWS Cloud Practitioner Certified

It has been a while since I completed this certification. Well, this deserved a page on its own and here it is.
shivdeep-aws-cp-certified

The reason why you would want to certify yourself on one cloud vs the other might range on a variety of factors from what you work on, you inclination towards one cloud vs the other, your future career path etc. If you ask me, it would be nice to have fundamental certifications in more than one Cloud. Helps understand things from a different perspective.

Certification in AWS

As of 2021, AWS comprises of 200 products and services including computing, storage, networking, database, analytics, application services, deployment, management, machine learning, mobile, developer tools, and tools for the Internet of Things.

Certification in AWS shows that you have the skills and knowledge to design, deploy and manage applications on Amazon Web Services. The process of training and learning required to pass the exams allows one to solidify principles and strengthen your knowledge. AWS introduced certifications in 2013 and it currently offers 12 certifications that cover both foundational and specialty cloud computing topics.

AWS Cloud Practitioner

This is a certification which is aimed at individuals who are looking to validate their overall understanding of the AWS Cloud. It does not necessarily have to be technical folks. It is useful for individuals in technical, managerial, sales, purchasing or even financial roles who work with the AWS Cloud. The break up of the questions in the exam is based on the below criterion –

Cloud Concepts (28%)
Security (24%)
Technology (36%)
Billing and Pricing (12%)

How I Prepared

Consistently dedicated an hour a day for around 3 weeks. (spent more that than nearing the exam date)

Created notes and revised.

Looked up on complete courses on YouTube (Free videos by Andrew Brown is highly recommended)

Strategy for the Exam

Don’t spend a lot of time on each question, if you are not too clear, mark the question and move on.
Eliminate obvious wrong answers (I found that this was applicable for some of the questions)
Look for specific keywords and relationship between questions

Resources

https://aws.amazon.com/certification/certified-cloud-practitioner/

Microsoft Azure and How I got myself AZ 900 Certified

Azure is arguably the most popular cloud provider out there, and with the recent US Department of Defense deal worth USD 10 billion, it has leap-frogged the competition (with Google, IBM and Oracle choosing not to compete or maybe not in the same league and AWS being the only contender). Additionally, closer home there was news about Microsoft India earnings crossing the USD 1 billion mark in the last fiscal with a big push attributed to Azure.

I had dabbled a bit in AWS and Azure before, but recently when we had a certain upskilling push at our organization I jumped head-first into it. So, a bunch of us were nominated to attend boot camps and I was chosen for the Azure track and specifically for the Azure 900 fundamentals course. It was a week-long session, with additional study sessions over the weekends. This was held in sometime in August and we had to enroll for the exam immediately after that. However, with sprints, release planning, demos, architectural reviews and with a heavy dose of procrastination, I never blocked a date for the exam. It was sometime in October, that I decided enough is enough and I had to prove to myself that I can get this certification done with 🙂

Firstly, an overview about certifications in Azure

Microsoft earlier had a popular certification called MCSE (Microsoft Certified Solutions Expert) which catered to Cloud Platform and Infrastructure. This certification basically needed a broad and deep knowledge of *almost* everything in Azure.

Azure is an huge ecosystem, and getting really good at a the whole thing was unwieldy, difficult and not popular with Developers, Administrators and Architects alike. There was a need for a revamp.

Enter, Role based certifications

It was an entire paradigm shift from focus on product knowledge to a skills-based approach. It is broadly classified for Administrators, Developers, Solution Architects, Functional Consultants and DevOps Engineers.

With these certifications, it is not about a having a badge of the knowledge you have, it is about being ready for a career in a specific role.

AZ – 900

AZ-900 is the a fundamental level certification for anyone who wants to start off with Azure. It is to prove that you are knowledgeable about Cloud and the various Azure services and their broad application.

The skills measured are –

Understand cloud concepts
Understand core Azure services
Understand security, privacy, compliance, and trust
Understand Azure pricing and support

The entire outline can be downloaded from here.

How I prepared

Take notes, create mind maps, use OneNote, scribble, whatever (but is important)
Dedicated atleast an hour each day for more than two weeks.
Prepare, prepare, prepare – Cannot emphasize this enough.
Pickup an exam slot and work backwards
- Assign atleast a couple of days for each skill measured
Hands-on is important
- Create an account and get familiar with the portal
- Provision resources
- Download the CLI and try commands (Powershell commands as well)
Mock exams are important
- To get familiar with the pattern
- Understand types of questions
- Crystallize concepts
- Confusing service names! (ATP vs AIP, Virtual Networks vs Virtual Network Gateways)
- Services seemingly doing similar work (LoadBalancers vs Application Gateways vs Firewalls vs NSGs)

Resources

Azure Learning (approx 10 hours)

Platform Quotes

Well, this is a blog for a curated list of quotes on ‘Platforms’, the software type where one can exchange value. I shall also try to quote of the source (or where a quote was originally quoted!).

Businesses want to move from a Pipe business model to a Platform Business model.

“You know you are building a platform if your users are using it in ways you have never imagined”

The goal of the platform is to enable interactions between producers and consumers – repeatedly and efficiently.

Platforms are ubiquitous.

Quotes from Geoffrey G. Parker

In recent years, more and more businesses are shifting from the pipeline structure to the platform structure. In this shift, the simple pipeline arrangement is transformed into a complex relationship in which producers, consumers, and the platform itself enter into a variable set of relationships.

Yet all are operating businesses that share the fundamental platform DNA—they all exist to create matches and facilitate interactions among producers and consumers, whatever the goods being exchanged may be.

The shift from protecting value inside the firm to creating value outside the firm means that the crucial factor is no longer ownership but opportunity, while the chief tool is no longer dictation but persuasion.

Platform competition requires treating buyers and suppliers not as separate threats to be subjugated but as value-creating partners to be wooed, celebrated, and encouraged to play multiple roles.

And while platform businesses themselves are often extraordinarily profitable, the chief locus of wealth creation is now outside rather than inside the organization.

Thus, every platform business must be designed to facilitate the exchange of information.

Yet, in most cases, platforms don’t create value units; instead, they are created by the producers who participate in the platform. Thus, platforms are “information factories” that have no control over inventory.

Facebook’s news feed is a classic multiuser feedback loop. Status updates from producers are served to consumers, whose likes and comments serve as feedback to the producers. The constant flow of value units stimulates still more activity, making the platform increasingly valuable to all participants.

Later still, LinkedIn created another interaction when it allowed thought leaders, and subsequently all users, to publish posts on LinkedIn for others to read, effectively turning the site into a publishing platform.

A platform’s overarching purpose is to consummate matches among users and facilitate the exchange of goods, services, or social currency, thereby enabling value creation for all participants.

As a result of the rise of the platform, almost all the traditional business management practices—including strategy, operations, marketing, production, research and development, and human resources—are in a state of upheaval. We are in a disequilibrium time that affects every company and individual business leader. The coming of the world of platforms is a major reason why.

Our goal is not to build a platform; it is to be across all of them.

~ Mark Zuckerberg

We want to be the platform that solution providers can use to run their businesses. ~ Bob Vogel

Platform thinking = Software design + Market design + Agility. ~Thierry Isckia

Your social platform will become the motherboard of your business

Every Business needs to become a Platform

“We are no longer in the business of building software. We are increasingly moving into the business of enabling efficient social and business interactions, mediated by software.”

Platform Musings

This post is not structured right now, it will be a mind-dump of my latest fascination – Platforms

You think #Product and say why? I think #Platform and say why not?

AI is the next electricity

If Data is the next oil, then AI is the next electricity

Bangalore

Have you ever wondered where the centre of a city is? Where do those milestones you see on a highway which tend to (->) 0 actually reach 0? How does GPS measure distances? Well, I took it upon myself to find out!

What other city than our Bangalore? namma Bengaluru?

I wanted to figure out where the centre of Bangalore / Bengaluru is! I searched for Google for the Lat/ Long Co-ordinates of this great city – this is what I found

Brilliant, so she is at a Latitude of 12.9716 and a Longitude of 77.5946.

Now, let us Plot!

I am using the map visualization package called ‘leaflet’ to plot. More details on this package at this link.

It is not a lot of code to plot using the leaflet package, a snapshot of RStudio with the code and the generated plot below –

Look at the location, are you surprised? Well, I was. I was expecting it to be at the junction of old city (Chickpet) or some place near City Market or Chamarajpet!

If someone from the future were to communicate to you (from another dimension using say Gravity 🙂 Interstellar style) and ask you to be in the centre of Bangalore, now you know where to wait! You just need to know when!

Word Cloud in R – Mythological Twist – Part II

Following the wonderful feedback I got on my previous post (WordCloud in R – Mythological twist), I thought I could do a similar text analysis on the other great Indian Epic, the Mahabharata!

This time it is bigger, a 5818 page, 14 MB pdf. The translation in question is the original translation from Kisari Mohan Ganguli which was done sometime between 1883 and 1896.

About the Mahabharata

The Mahabharata is an epic narrative of the Kurukshetra War and the fates of the Kaurava and the Pandava princes.

The Mahabharata is the longest known epic poem and has been described as “the longest poem ever written”. Its longest version consists of over 100,000 shlokas or over 200,000 individual verse lines (each shloka is a couplet), and long prose passages. About 1.8 million words in total, the Mahabharata is roughly ten times the length of the Iliad and the Odyssey combined, or about four times the length of the Ramayana.

The first section of the Mahabharata states that it was Ganesha who wrote down the text to Vyasa’s dictation. Ganesha is said to have agreed to write it only if Vyasa never paused in his recitation. Vyasa agrees on condition that Ganesha takes the time to understand what was said before writing it down.

The Epic is divided into a total of 18 Parvas or Books.

Well, if Rama was at the centre of Ramayana, who was the equivalent in Mahabharata? Krishna? One of the Pandavas? One of the Kauravas? Dhritarashtra? Or one of the queens – Draupadi? Kunti? Gandhari?

Let us find out –

Since, this is a continuation to the first blog in this series, I would not take you through the intricacies of downloading and installing packages. Also, there is a Rpdf that needs to be installed, you could lookup on the instructions in this link.

Download and copy the pdf onto a folder in the local file system. You may want to read the pdf in its entirety to a corpus.

mahabharata <- Corpus(URISource(files), readerControl = list(reader = Rpdf))

If I look at the environment variables, I can see the Corpus populated, which says it has 1 element and is of a 26.8 MB size.

You could have a look at the details using ‘Inspect’

Now, we can begin processing this text, firstly create a content transformer to remove any take a value and replace it with white-space.

> toSpace <- content_transformer(function(x, pattern) {return (gsub(pattern, " ", x))})

use this to eliminate colons and hyphens

> mahabharata <- tm_map(mahabharata, toSpace, "-")
> mahabharata <- tm_map(mahabharata, toSpace, ":")

Next, we might need to apply some transformations on the text, to know the available transformations type getTransformations() in the R Console.

we would then need to convert all the text to lower case

> mahabharata <- tm_map(mahabharata, content_transformer(tolower))

let us also remove punctuation and numbers

> mahabharata <- tm_map(mahabharata, removePunctuation)

> mahabharata <- tm_map(mahabharata, removeNumbers)

and the stopwords

> mahabharata <- tm_map(mahabharata, removeWords, stopwords("english"))

The next step would be to create a TermDocumentMatrix, a matrix that lists all the occurrences of words in the corpus. The DTM represents the documents as rows and the words as columns, if a word occurs in a particular document, the matrix entry corresponding to that row and column is 1 or it is a 0. Multiple occurrences are then added to the same count.

> dtm <- TermDocumentMatrix(mahabharata)
> m <- as.matrix(dtm)
> v <- sort(rowSums(m),decreasing=TRUE)
> d <- data.frame(word = names(v),freq=v)

Looking at the frequencies of the words, we may need to remove certain words to distill the insights from the DTM, for e.g., words like “thou”, “thy”, “thee”, “can”, “one”, “the”, “and”, “like”…

> mahabharata <- tm_map(mahabharata, removeWords, c("the", "will", "like", "can"))

upon further refinement, let us look at the top 15 frequently appearing words

Brilliant, Isn’t it after all about a great battle between sons to be a King?!

The next occurrences throw up some interesting observations.

Yudhishthira
Arjuna
Drona
Bhishma
Karna

and Krishna, who is considered central to the Epic is at 8th of most mentioned characters in Mahabharata.

Let us generate the WordCloud from this

Source for content on the Mahabharata

WordCloud in R – Mythological twist

A WordCloud in R

Let Noble thoughts come to us from every side

– Rigveda, I-89-i

Have you ever wondered what it would be to do a textual analysis of some ancient texts? Would it not be nice to ‘mine’ insights into Valmiki’s Ramayana? Or Veda Vyasa’s Mahabharata? The Ramayana arguably happened about 9300 years ago. In the Thretha yuga. The wiki for Ramayana.

The original Ramayana consists of seven sections called kandas, these have varying numbers of chapters as follows: Bala-kanda—77 chapters, Ayodhya-kanda—119 chapters, Aranya-kanda—75 chapters, Kishkindha-kanda—67 chapters, Sundara-kanda—68 chapters, Yuddha-kanda—128 chapters, and Uttara-kanda—111 chapters.

So, there are a total of 24,000 verses in total. Well, I don’t really have the pdf of the ‘Original’ version, I thought I could use C. Rajagopalachari’s English retelling of the epic. This particular book is quiet popular and has sold over a million copies. It is a page-turner and has around 300 pages.

How about analyzing the text in this book?

Wouldn’t it be EPIC?!

That is exactly what I want to embark on this blog, text mining helps to derive valuable insights into the mind of the writer. It can also be leveraged to gain in-tangible insights like sentiment, relevance, mood, relations, emotion, summarization etc.

The first part of this series would be to run a descriptive analysis on the text and generate a word cloud. Tag clouds or word clouds add simplicity and clarity, the most used words are displayed as weighted averages, the more the count of the word, bigger would be the size of the word. After all, isn’t it visually engaging than looking at a table?

Firstly, we would need to install the relevant packages in R and load them –

The second step would be to read the pdf (which is currently in my working directory)

I first validate if the pdf is there in my working directory

The ‘tm’ package just provides a readPDF function, but the pdf engine needs to be downloaded. Let us use a pdf engine called xpdf. The link for setting up the pdf engine (and updating the system path) is here.

Great, now we can get rolling.

Let us create a pdf reader called ‘Rpdf’ using the code below, this instructs the pdftotext.exe to maintain the original physical layout of the text.

>  Rpdf <- readPDF(control = list(text = "-layout"))

Now, we might need to convert the pdf to text and store it in a corpus. Basically we need to instruct the function on which resource we need to read. The second parameter is the reader that we created in the previous line.

>  ramayana <- Corpus(URISource(files), readerControl = list(reader = Rpdf))

Now, let us check what the variable ‘ramayana’ contains

If I look at the summary of the variable, it will prompt me with the following details.

The next step would be to do some transformation on the text, let us use the tm_map() function is to replace special characters from the text. We could use this to replace single quotes (‘), full stops (.) and replace them with spaces.

Also, don’t you think we need to remove all the stop words? Words like ‘will’, ‘shall’, ‘the’, ‘we’ etc. do not make much sense in a word cloud. These are called stopwords, the tm_map provides for a function to do such an operation.

> ramayana <- tm_map(ramayana, removeWords, stopwords("english"))

Let us also convert all the text to lower

> ramayana <- tm_map(ramayana, content_transformer(tolower))

I could also specify some stop-words that I would want to remove using the code:

> ramayana <- tm_map(ramayana, removeWords, c("the", "will", "like", "can", "and", "shall"))

Let us also remove white spaces and remove the punctuation.

> ramayana <- tm_map(ramayana, removePunctuation)
> ramayana <- tm_map(ramayana, stripWhitespace)

Any other pre-processing that you can think of? How about removing suffixes, removing tense in words? Is ‘kill’ different from ‘killed’? Do they not originate from the same stem ‘kill’? Or ‘big’, ‘bigger’, ‘biggest’? Can’t we just have ‘big’ with a weight of 3 instead of these three separate words? We use the stemDocument parameter for this.

> ramayana <- tm_map(ramayana, stemDocument)

The next step would be to create a term-document matrix. It is a table containing the frequency of words. We use ‘termdocumentmatrix’ provided by the text mining package to do this.

> dtm <- TermDocumentMatrix(ramayana)
> m <- as.matrix(dtm)
> v <- sort(rowSums(m),decreasing=TRUE)
> d <- data.frame(word = names(v),freq=v)

Now, let us look at a sample of the words and their frequency we got. We pick the first 20.

Not surprising, is it? ‘Rama’ is indeed the centre of the story.

Now, let us generate the word cloud

> wordcloud(words = d$word, freq = d$freq, min.freq = 3, max.words=100, random.order=FALSE, rot.per=0.60,  colors=brewer.pal(8, "Dark2"))

Voila! The word cloud of all the words of Ramayana.

A view of plot downloaded from R.

If you like this, you could comment below. If you would like to connect with me, then be sure to find me on Twitter, Facebook, LinkedIn. The links are on the side navigation. Or you could drop an email to shivdeep.envy@gmail.com

Data Manipulation in R with dplyr

The R language is widely used among data scientists, statisticians, researchers and students.

It is simply the leading tool for statistics, data analysis and machine learning. It is platform-independent, open-source, and has a large, vibrant community of users.

The Comprehensive R Archive Network is the one-stop-shop for all R packages.

This really brings us to the package to be discussed on this blog – dplyr. The CRAN documentation for dplyr can be found here.

For this blog, I would be demonstrating the 5 operations of the package. The first thing we would need is to install the package and load the library.

> install.packages(“dplR”)

> library(dplR)

We then need to find a dataset on which we could run these operations. CRAN makes the download logs of their packages publicly available here – CRAN package download logs. Let us download the file for July 8, 2014 (we could really pick a log from any date) onto RStudio’s working directory.

Once the file has been copied onto the working directory of R, execute the below line (where the variable path2csv stores the location of the csv)

> mydf <- read.csv(path2csv, stringsAsFactors = FALSE)

we then save the data frame onto a variable called cran by converting it to a tbl_df to improve readability. Calling the variable cran prints out the contents.

> cran <- tbl_df(mydf)

> cran

The dplyr philosophy is to have small functions that do one thing well. There are basically 5 commands that cover most of the fundamental data manipulation tasks.

select()

Usually in the entire data set that we use for analyis, we would really be interested in a few columns. This function is used to select / fetch the columns which are required. If I only need the columns ip_id, package and country. I execute the following statement –

> select(cran, ip_id, package, country)

It is important to note that the columns are returned in the order in which we specified, irrespective of how it was in the original dataframe.

We could also use the ‘-‘ sign to ommit the columns we do not need.

> select(cran, -time)

filter()

Now that we know how to select columns, the next logical thing would be to be able to select rows. That is where the filter() function comes in.

This is like the ‘where’ clause in SQL. Let us understand this by an example –

> filter(cran, package == "swirl")

If you look at the column ‘package’, we now see that the resulting dataframe has only rows which have the package as ‘swirl’.

Multiple conditions can be passed to filter() one after the other. For example, if I want to fetch all swirl packages downloaded on the OS – linux in India:

> filter(cran, package == "swirl", r_os == "linux-gnu", country == "IN")

arrange()

This is used to order the rows of a dataset according to the values of a particular variable. Suppose we want to order all rows of a dataset in ascending / descending order of a column. Notice the ip_id column listed in descending order.

> arrange(cran2, desc(ip_id))

mutate()

This function is used to edit or add additional columns to the dataframe. Suppose I want to convert the size column which is in bytes to megabytes and store the values in a column called size_mb.

> mutate(cran3, size_mb = size / 2^20)

sumarize()

This function is used to collapse the dataset into a single row, the go-to function to calculate the mean in a sanitized dataframe.

For example – I want to know the average download size from the size column.

> summarize(cran, avg_bytes = mean(size))

sumarize() can also be used to fetch records in groups using the FOR EACH construct.

Disclosure: The above example is from the dplyR lesson on the swirl package.