Category Archives: technology

Practical scenario for ML Transcribe on cloud

One of the well-known services in AI/ML stack (artificial intelligence & machine learning) of cloud computing is transcribing (speech to text) the audio and analyzing it.

While cleaning up my phone, I realised, that it has piled up a lot of voice recordings that I did on various conferences & seminars I attended.  Recording the speeches/discussions directly on phone, is a quick way of taking notes, which you can refer later & if you label it & make a transcript, it’s even better for indexing.
Thus, my experiments with Transcription services on the cloud started! (I wonder whether Evernote or OneNote has transcribe feature which converts speech-to-text on-the-go on mobile).

I had to trim a 45 minutes long audio file to 1 min. sample & convert it to mp3 (using VLC Player). To keep things simple there’s only 1 speaker talking in the audio file, although the job is not simple enough for machines, you can listen to sample audio (and try to text it in comment section!)

Google Cloud pleased me as it supports long range of input languages as audio source, so I could choose Indian English, but on the downside, it only supported 1 min audio file for free-demo. (for longer files, you may have to call APIs & write a code). Here’s the result  from Google:

150
the world's most interesting thing and I think you are the advantage of building for 
India and building the world have a domestic economy is big enough for you to grow 
as a starter and global market also I don't need to both the option that I want to 
talk to someone is very difficult to the reason is these words between is true that 
was open find out the world after appoint the world can give the field of India 
can eat Universal pain

But it gives different results if you select US English as source language, much better if selected model is ‘video’ (other options are phone call, search/command, default) Video model is not available for IN-EN.

We are around 250 in India.  to the point that
the world could have come in and they want to be locked up all over the top again, 
but more importantly I think is huge opportunity for startups the third and most 
interesting thing and I think you can speak about the relevant is after the DND 
Festival we spend some time at Israel is India has this unique Vantage of building
 for India and building for the world. We have a domestic economy, which is big 
enough for you to grow as a start-up and it has a and a global market. 
Also, I think we need a bill for both the opportunity that I only focus on one is 
very defensive because the reason is these words between these two that what 
have different kind navigation for the world to start learning after a point 
because India will be the world's and usually right so the because the scene of 
India can be Universal game.

If you check out AWS AI/ML/DL stack, it gives you ready to use tools such as Rekognition, Translate, Polly which can process the input without requiring any coding. What we need in our case is: AWS Transcribe which reads audio files stored in S3 and export the text in json format. The best part is you do not need to write any code for this nor there’s 1 min. restriction for audio. But it understands only US-EN and Spanish, which did not give great results.

Get it. So with that the ones they want. 
So  we have all but one party is huge over to you first. The most impressive thing. 
And i think you could spend some time. Is there? He has a unique case beginning for
the world. You have a domestic big enough to stop it and the opportunity only 
focused on one very defensive because of easiness. These words that was a different
world will start learning after a point because and you two guys with big feet,
you know what?

IBM Watson is a dark horse here, showed relatively fewer errors. But if you choose British English, the result was full of ‘yeah’, which is funny (and stupid!)

We get 150. This. For the one that. The one. The what we love to help all those 
online games at 148 is huge opportunity for stocks the most interesting thing 
and I think you would be more than enough for the effects of the sense of time 
and was there. Is media has unique blockades probability for India and before 
the war. Yeah domestic economy big enough for you to go in the stock and it 
has a. And over the hallways like the need to clear the board the all the Jews 
that only focus on one is very different too because the reason is these lawyers
 or differently from the way the stock offer appointed with nearly the word off 
and you could be like the because of being in the op he or university.

Lastly, Microsoft’s Azure cognitive Services gave me hard time. Unlike other cloud service providers mentioned above, it doesn’t have a quick demo page where I could simply upload a minute long file to test. So I had to sign-up for Azure services using credit card, email, mobile, & OTP verification, only to find that I am not eligible for their 30 day free trial. I somehow got the free trial after contacting the customer support. After much fooling around Azure Cognitive Services API documentation, I realised that API only allows 10 seconds of audio file (certainly not a tool we are looking for, this one is more suited for command/search). Apparently cris.ai, customspeech.ai does a batch transcription, which is again a lot of efforts. After few RestAPI experiments, I ditched! I may be missing something @Azure for such a simple task, (Please add a comment below if you can guide me to right path, or if you know easier way which I missed). But I think they should have made it simple ready-to-use service/demo.

Finally, I decided to transcribe it myself as none of the above cloud players are up to the mark.
Human interpretation *(who knows the context.)
I am trying verbatim here. I must confess, this audio was very tough to crack even for someone who was present in that discussion and recorded it. I can imagine more errors if this assigned to a layman.

...Twenty hundred accelerators. And we are around hundred and fifteen. so almost… 
So to the point that, the world’s gonna come in. and we gonna...  
..upper game but more importantly, I think it’s huge opportunity for startups.
The third and most interesting thing and I think you’ll speak about it, 
Amey, after the DLD festival where we spent some time in Israel, is 
India has this unique advantage of building for India and building for world. 
We have a domestic economy big enough for you to grow as a startup and it has a 
and a global market also I think we need to build for both.
The opportunity that I wanna focus is very different because the reason is these
worlds between the what is different for India different for world is start
blurring after a point because India will be the world. And mutually so bigger the 
theme for India is Universal theme.

Surely, above results has some errors as well!
Conclusion: Are we there yet? What do you think?

Clearly, you still need human intervention and cannot rely fully (in this case not at all) on these automate services.  In Translation job these machines at least aid 50-70% but for transcription we still have a lot to cover (& miles to go 🙂 ).

https baby!

That was long overdue! I’ve been meaning to add secure certificate for the domain (and all subdomains) amrute.me for quite some time. Finally, I made the site connection secure, thanks to Let’sEncrypt.

By the way you should donate to its wonderful cause of providing SSL/TLS certificate for free (even the wildcard certificates!!) LetsEncrypt is free, automated & Open certificate authority(CA).

The last straw #Facebook

Facebook feed algorithm is driving people to extremism. It’s making their beliefs even stronger, cementing it further by watching likeable content repetitively. While Facebook notoriously, has been working on some or the other secret social experiments at large scale, this is the last straw. I have been tempted to stop using Facebook since many years but had hold back the thought because of the business clients on social media.
I don’t know how Mahesh Murthy manages to take a stand and use fb the same time… something to learn?
But I think they are not prepared for the negative consequences, the negative sentiments and extremism causing on the society at large. This may look bit vague, but this is a very serious concern.
This should be the movement, to stop Facebook from manipulating peoples mind. I think a mentor would help! Do you feel the same? Are you game for this? RT below tweet, or better yet, comment below (using your facebook credentials 😐 )
 

DrupalCon Asia here I come

DrupalCon is happening in Asia for the first time. Lucky for me it’s happening in India, Mumbai near I live (at IIT Bombay to be precise). I’ve been following Drupal Events and news since the launch of D8; and thinking of attending the conference. Today is the last day to pay for the tickets and register for the event. I delayed my decision to max and paid more than double the super-early bird price. ₹ 8,000 is still considered an costly conference in India even if you bill it as business expense. I know it’ll payback. Drupal is a whole new level, compared to other obvious choices for website development (read WordPress & Joomla). I was blown away by the built-in features & the possibilities with Drupal 8. D8 took forever to launch since the last version (released in 2011) and it changed everything. I hope the wait is well worth, especially now, when I’ve decided to work on the project using Drupal platform. Currently I’m building it on D7 due to lack of mature modules. But, eventually more mods will be available to D8. Sooner the better. Better even before the launch of our app. My plan is to start working on Drupal 8 as soon as I am satisfied with MVP (minimum viable product/ prototype) on D7, and built it on HHVM instead of currently available version of PHP.  HHVM, the hacked PHP by Facebook team, is surely faster than PHP.
DrupalCon is multi-track (parallel lectures) event with loads of informative sessions and non-stop flow of the tips, lessons & knowledge. They have very unique session (or should I call it un-session?) called BoF.

Birds of a Feather sessions (or BOFs) are informal gatherings of like-minded individuals who wish to discuss a certain topic without a pre-planned agenda. BOFs allow groups to meet and discuss issues relating to regular conference sessions and talk out common problems facing the community.

Isn’t it cool? A space for like minded people with informal settings and no-agenda. I’m really excited to attend this conference. I never attended DrupalCamps (the smaller version) nor DrupalCons before. And I’m sure there’s going to be loads of take-away. I’ll write post-event blog about my experience.
If you happened to be coming to DrupalCon Asia next week feel free to ping me on twitter.

Look papa, New theme!

Finally made a move to fully responsive theme (not bootstrap through). In the process of doing so, I replaced my old theme files (404.php); so instead of copy new files in new theme folder, I copied them to old theme directory. I did that on my webserver (Ubuntu Server) on terminal using SSH. And unfortunately there’s no undo in CLI. 🙁 I’m glad I’ve started using Git now.

Microsoft invites to Azure Vidyapeeth

Today, in my inbox, I got an invitation to the webinar series called “Azure Vidyapeeth”
The webinar series comprises of 15 session starting from today till the end of the month April 2015. …Nothing fancy here, just introduction and demo of Azure product features. But what caught my eyes is the name they given to the series, Azure Vidyapeeth.
If you’re interested in learning this Microsoft Cloud computing/hosting technology? Go ahead and register there for free, and don’t forget to check the Botpress site as well!
PS: There’s mobile app exclusively created for Azure Vidyapeeth for WindowsPhone. I hope the same app is available for other mobile platforms.

Why Online Office365 cannot insert OneDrive files directly?

I use Ubuntu OS and simplest way to use Microsoft Office suite on linux is Online Office 365 -accessing via Onedrive.com or Office.com. Having most of the important files on OneDrive, I’ve no problem switching from device to device (or between different Operating systems). Although LibreOffice (which comes out of the box with Ubuntu Desktop) is pretty solid and has more features than MS OfficeOnline, I rather prefer online office365 as it automatically saves files on OneDrive.
But Online Office lacks a major feature. While working on document online (on a web-browser), you cannot insert files/pictures into document from your cloud storage (ie. OneDrive). I was shocked to find that its missing! This feature is so obvious that Google Docs has this since years. You can insert images from your Google Drive to spreadsheet/presentation/doc -without having to upload anything from your local machine.
C’mon, Microsoft! How could you miss this? Now I have to download some of images stored on OneDrive to my computer just to insert and upload them on my Word/PowerPoint document!

In love with Bootstrap

Recently, I realised that my unchanged theme (since 2011!) is obsolete. Although this blog is mobile friendly (thanks to jetpack, we’ve mobile theme), I’ve been thinking of single responsive theme that would work across all devices. While researching the solution, I discovered many ‘cool’ themes &  I must confess, I’m in love with Bootstrap. It’s such a fun playing around.
I might implement Bootstrap CSS on this blog some time soon…

classic theme

I checked my blog (front side of the blog) after so many days. The fonts and site design looks old-fashioned, kind-of ‘classic’. Yes, then default theme TwentyTen for the WordPress has officially become classic. I’ll re-design my websites soon.

For #LTSP server, Edubuntu is better choice. Thanks @RigvedRakshit for the tip!

Microsoft beats the competition by 1TB offer on OneDrive

OneDrive (formerly known as SkyDrive), the cloud storage from Microsoft now offers whooping 1TB of data storage to its premium subscribers. Earlier 1TB space was only offered to Office 365 for business and SkyDrive Pro (now OneDrive for business) subscribers. Now the offer is extended to Office 365 Home & Personal subscribers as well. This change makes it 1TB available for every paid subscriber of Office 365 irrespective of its plan and size of the organisation. (something Google i/o 2014 announcement for unlimited Drive storage doesnt offer for lower plans -you’ve to pay $5/month extra to avail that offer, also, small organisation are not eligible for unlimited space on G-Drive.)
onedrive-1tbI’ve been a loyal member of OneDrive since the beginning when they used to offer 25GB of space for free users. From time to time, I’ve been given extra space on OneDrive in the name of ‘loyalty bonus’ for using Microsoft’s products and services (including WindowsPhone 8, OneDrive on mobile for camera roll backup, Office 365 etc.). And this tiny bonus pleases me, makes me write blog-posts on it and promote their services for free. But this new offer beats it all. This is no more a ‘tiny addition’, this is game changer. You get 1 TB straight, (1,024 GB) makes my enthusiast bonus (20GB), loyalty bonus (10GB) & camera roll bonus (3GB)so small and negligible! Sigh!
OneDrive has increased the free quota from 7GB to 15 GB. So when you sign-up at OneDrive you get 15 GB of storage space on cloud to start. (Same as Google Drive is offering to all Google Account holders for free.)