Data architecture can be difficult to understand – when new to the subject, it can be very hard to distinguish between hype and concept, between real things (with obscure names) and conceptual names which are actually marketingy gizmos. Over time in consulting I come back to a fish based explanation, which I synthesised in a phone call last year with someone and have finally committed to paper.
My firm have recently signed up with Snowflake, so on a spare afternoon I decided to compare the different ways you can connect data to Snowflake from Alteryx. I’m doing this with some Kickstarter data I found on data.world; I’m breaking the data down into four tables, and writing each table with a different method.
Webscraping can vary between wildly hard (purposefully or otherwise) and being the notch below an open API to extract data. However, there are often some hints you can use to check whether or not a particular site will be one of the easy or hard ones.
You’re not really meant to say this when you are (or have been) a data analyst/scientist/whatever, but I have a limited patience/tolerance for the reformatting and cleaning of data.
The story of The Warming Stripes needs little in the way of explicit direction, which is also why it is so adaptable to odd media (like ties and earrings).
I haven’t felt compelled to write on this blog in some time, partly out of time and partly out having … More
If you know me in real life, you probably didn’t know me for very long before realising that privacy and mass surveillance are some of my hot topics, which I could discuss for several hours without much of a break.
What is particularly interesting about this article and the debate it ignited (or soured, depending on how you look at it) is that it seems blind to the increasing interdisciplinary trend in data science – of which medicine is an obvious part – where fresh eyes find new ways to understand a problem.
Yesterday in the Hague, Radovan Karadzic was convicted for 40 years for his role in the Bosnian war. What some people may not know is that statistical analysis of migrational movements and killings was one of the ways external observers demonstrated that there was a systematic campaign of ethnic cleansing in Serbia.
Our societies use “threat levels” determined by domestic intelligence agencies for policing and security policy – but what do they mean?