Tag: tutorial

  1. Using Apache Spark Neural Networks to Recognise Digits

    One of the famous machine learning challenges is the performing handwritten character recognition (classification) over the MNIST database of handwritten digits. The MNIST dataset has a training set of 60,000 and a test set of 10,000 28x28 pixel images of handwritten digits and an integer value between 0…

    on tutorial spark sparkml scala

  2. Setting up a Standalone Apache Spark Cluster

    A few people have asked me how to set up a small Standalone Spark Cluster for testing. Here are the scripts for Ubuntu 15.10 to install Apache Spark 1.6.0 which should have you up and running very quickly. This guide assumes you have done new installation of…

    on tutorial spark

  3. Natural Language Processing with Apache Spark ML and Amazon Reviews (Part 2)

    Continues from Part 1. 4 Execution 4.1 The Pipeline Now we have all the components of the pipeline ready all that is needed is to load them into the Spark ML Pipeline(). A pipeline helps with the sequencing of stages so that we can automate the pipeline in the…

    on tutorial spark sparkml scala

  4. Natural Language Processing with Apache Spark ML and Amazon Reviews (Part 1)

    The most exciting feature of Apache Spark is it's 'generality' meaning the ability to rapidly take some text data, transform it to a graph structure and perform some network analysis with GraphX take that dataset and apply some machine learning algorithms with SparkML and store it in memory and query…

    on tutorial spark sparkml scala

  5. Performance Tuning Spark WikiPedia PageRank

    In my previous post I wrote some code to demonstrate how to go from the raw database extracts provided monthly by WikiPedia through to loading into Apache Spark GraphX and running PageRank. In this post I will discuss my efforts to make that process more efficient which may be relevant…

    on tutorial spark scala graphx

  6. Computing WikiPedia's internal PageRank with Apache Spark

    Recently I have spent a lot of time reading and learning about graphs and graph analytics which naturally drew me to Apache Spark GraphX having previously played with Neo4J. The benefits of GraphX are: fully open source scalable using the Apache Spark model written in Scala which I have been…

    on tutorial spark scala graphx

  7. Migrating from Joomla to Ghost

    After yet another Joomla security issue and the rise of more simple programming using Node.js I thought it was time to port my old Joomla 2.5.x blog to a new platform: Ghost. Unfortunately I couldn't find many resources online on how to do the data migration but…

    on tutorial sql

  8. Loudspeaker Frequency Response Measurement

    This tutorial aims to show you how to perform a loudspeaker frequency response measurement using ARTA for a two-way loudspeaker system. Required Equipment A sound card (either internal or external) which has two line-out (left and right channels) and two line-in (left and right channels). An amplifier (5W+ is more…

    on Audio technical tutorial

  9. Replace a Tweeter Voice Coil

    If you are reading this is it is likely you are kicking yourself for destroying a perfectly good tweeter. Don't worry. Fortunately Seas expected this kind of reckless behaviour and provides replacement voice coils for it's tweeters. In this tutorial I will show you how to replace the voice coil…

    on Audio technical tutorial