• Introduction to Snowflake

    Introduction to Snowflake
    What is Snowflake and what is it used for, Snowflake AI Data Cloud, Snowflake Architecture, Introduction to SnowSQL, Loading and managing data, Creating tables and schemas, Basic SQL queries, Managing users and roles, Configuring virtual warehouses, Snowpark, Streamlit, Tips for optimising costs and performance.

    Read More

  • Big Data: ISO 8000 and Data Quality

    Big Data: ISO 8000 and Data Quality
    Big Data: ISO 8000 and data quality What are the norms, standards and frameworks that help us to manage data quality in Big Data environments?

    Read More

  • Goodbye 2022 Hello 2023

    Goodbye 2022 Hello 2023
    A new year begins and I review the words of 2022 most used in my documentation, a reflection of the work done throughout the same... 2022 has been the year of Python courses, oriented to Big Data (Hadoop, Spark) and Analytics (pandas, matplotlib and numpy). There were also courses on automation with DevOps and the use of GCP or Amazon Cloud.

    Read More

  • Big Data Fundamentals

    Big Data Fundamentals
    Big Data Fundamentals Book. I share a Jupyter Book that I have made with the notes prepared for the Big Data Fundamentals course. Within the syllabus there is an introduction to Big Data and data analysis, market and trends, its history, examples of use cases, best practices, and Big Data processes among other things.

    Read More

  • Big Data Fundamentals, Part II

    Big Data Fundamentals, Part II

    I'm sharing Big Data Fundamentals, Part II,(Part I is here) with an introduction to Big Data covering: Big Data processes: ingest, store, process/query, visualize; tools and technologies: Hadoop, Sqoop, Kafka, Mesos, Redis, CouchDB; Document stores: MongoDB; Column stores: HBase + Cassandra; Big Data analytics: Spark, Storm; and Elastic Stack: Logstash, ElasticSearch and Kibana.

    We'll see also Machine learning techniques with Spark (MLlib, Streaming) and TensorFlow.

    Read More