This course introduces students to the capabilities of BigSheets. BigSheets is a component of IBM BigInsights through the Analyst and the Data Scientist module. It provides the analyst the ability to be able to visualize and analyze data stored on the HDFS using a spreadsheet type interface without any programming.
IBM Open Source & Big Data Analytics Courses
Click Here to Request It.
DW653G – BigInsights Analytics For Programers
This course is designed to aid programmers who are working with IBM’s InfoSphere BigInsights. Students will learn how to create annotators through the use of IBM’s Annotation Query Language (AQL). Analyzing data using Apache’s Hadoop normally requires that MapReduce programs be written. Students will learn how to use Jaql to create high level programs that are decomposed into Hadoop MapReduce programs. Students will learn a foundation to program using the Apache Pig language, and how to publish a text analytics application from the BigInsights development environment to a BigInsights server.
DW654G – IBM BigInsights Text Analytics (V4)
This course will teach students how to use IBM BigInsights Text Analytics, an information extraction system, to extract information from unstructured and semi-structured documents. Using IBM BigInsights Text Analytics students can create extractors using a visual web interface. The visual extractors are then automatically translated into Annotation Query Language (AQL) rules to extract structured information from unstructured and semi-structured documents. Students can apply Text Analytics to big data at rest in IBM BigInsights and big data in motion in IBM Streams.
DW664G – IBM Big SQL for Administrators (v5.0)
This course is designed to introduce the student to some of the additional capabilities and the administration of IBM Big SQL. IBM Big SQL allows you to access your HDFS data by providing a logical view to it. You can use the same SQL that was developed for your data warehouse data on your HDFS data. This course covers Big SQL security using row and column access controls, impersonation, and data federation. The course also covers some of the best practices, performance tuning, and monitoring techniques, YARN integration and also includes an optional unit to explore a Big SQL installation.
DW724G – Programming for IBM InfoSphere Streams V4 with SPL
This course teaches students about the Streams Processing Language. It will begin with the basic concepts of InfoSphere Streams and the basic Streams Processing Language operators used in a Streams program. Students will learn how to access data from an external source using the Source type operators and write an output stream using the Sink type operators.
Students will then learn how and when to use the various Stream operators, like the Functor, Punctor, Aggregation, Sort, Join, Split, Barrier, Delay, and Switch operators. The second half of the course shows how to control the placement of processing elements and the debugging capabilities of the Streams Processing Language. Students will also learn about consistent regions and how to use them to process tuples at-least-once.
DW732G – Administration of IBM Streams
This course enables students to acquire the skills necessary to administer an IBM Streams system. This course covers creating Streams domains and instances, using ZooKeeper in a high availability environment, viewing the state of Streams domain and instance services, stopping and starting processing elements, viewing the jobs and processing elements that are running, and a variety of other topics. In addition, it covers defining resource tags, adding a resource to a Streams domain and instance, setting the access control list for security objects to give permission to users to work with those objects, and submitting and cancelling Streams jobs.