Plan du cours
Introduction
Principes de l'informatique distribuée
- Apache Spark Hadoop
Principes de Data Serialization
- Comment les objets de données sont transmis sur le réseau Sérialisation des objets Approches de sérialisation Tampons de protocole Thrift Apache Avro caractéristiques de la taille, de la vitesse et du format de la structure des données intégration du stockage de données persistant avec des langages dynamiques schémas de typage dynamique gestion des modifications de données non balisées
Data Serialization et informatique distribuée
- Avro en tant que sous-projet de sérialisation Hadoop Java Sérialisation Hadoop Sérialisation Avro
Utiliser Avro avec
- Hive (AvroSerDe) Cochon (AvroStorage)
Portage des frameworks RPC existants
Sommaire et conclusion
Pré requis
- Une familiarité générale avec l'informatique distribuée.
Nos Clients témoignent (6)
Trainer's preparation & organization, and quality of materials provided on github.
Mateusz Rek - MicroStrategy Poland Sp. z o.o.
Formation - Impala for Business Intelligence
I thought he did a great job of tailoring the experience to the audience. This class is mostly designed to cover data analysis with HIVE, but me and my co-worker are doing HIVE administration with no real data analytics responsibilities.
ian reif - Franchise Tax Board
Formation - Data Analysis with Hive/HiveQL
Many hands-on sessions.
Jacek Pieczątka
Formation - Administrator Training for Apache Hadoop
The VM I liked very much The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly I liked the facility in Dubai.
Safar Alqahtani - Elm Information Security
Formation - Big Data Analytics in Health
The fact that all the data and software was ready to use on an already prepared VM, provided by the trainer in external disks.
vyzVoice
Formation - Hadoop for Developers and Administrators
practical things of doing, also theory was served good by Ajay