My abstract: In this talk we are going to examine Data Analytics in the Cloud. In particular, we will demonstrate the data flow for Analytics and Machine Learning, starting with sourcing data (structured, semi-structured and unstructured), pipelining it in both batch mode and streaming, storing it in a Data Lake, and finally making it available to Machine Learning and Business Intelligence. We will illustrate this complex engineering process with examples from Amazon Web Services (AWS).
Books recommended at the talk:
- Lincoln on Leadership, by Donald T. Phillips
- The Founding Fathers on Leadership, by Donald T. Phillips
- The one minute manager builds high performance teams, by Ken Blanchard
- The Serving Leader, by Ken Jennings and Stahl-Wert
- The Secret, by Ken Blanchard and Mark Miller
- Leading Change, by John P. Kotter
- The Leadership Challenge, by James M. Kouzes and Barry Z. Posner
On April 17, 2019, I will be speaking at the MIT Forum about Cybersecurity. This event is open to the public. For details:
My student Geetanjali (Geet) Agarwal defended her masters thesis titled Aneka – Wavelet Image Hashing Algorithm, see announcement, where the contribution is a framework of hashing algorithms for image recognition. This important work is done in collaboration with the SoCal High Technology Task Force (HTTF). Geet deployed the AWS to accomplish her results, including EC2 instances and MySQL databases used to run experiments on thousands of images. Geet’s thesis will be available after the final draft is ready.
On November 6, 2018, at 7:30am, I am giving a talk to the Regional Defense Partnership for the 21st Century (RDP-21) on Computer Science at CI. Here is the meeting location.
It was a pleasure to speak at the AWS/CSU Research in the Cloud series. By nature I am not a strong promoter of any technology, and the browser, OS or editor “wars” frankly bore me; I sometimes use a “lesser” technology because it happens to be more convenient, or because I don’t have the time to learn a “better” technology, or many other good reasons.
However, as a researcher and teacher I am absolutely thrilled with what AWS has to offer. I regularly give tours of our computer labs at CSU CI (to local companies, prospective graduate students, CSU trustees, fundraising prospects, etc.), and I explain that three things make it possible for a relatively small and unknown campus like ours to compete in scientific & engineering output in the national and international arena:
- How cheap embedded systems have become; a Google Raspberry Pi is $35, and it comes with Linux and GPIO that makes it into a universal controller.
- How cheap 3D printing has become, and in turn this frees us to some extent from having to build an expensive manufacturing lab.
- And AWS: Amazon Cloud Computing Services. Instead of buying, maintaining, cooling and powering expensive servers, we can immediately utilize the required services, and pay as we go. This works very well for a university because we do not have to make up-front capital investments, and our usage is not always the same (e.g., practically no classes in the summer).
Material related to the talk
- Examples of AWS related projects that my students and I have undertaken over the last year: http://prof.msoltys.com/?tag=aws.
- AWS presentation slides.
- Video of the presentation (my talk start at about 12min)