Summer Workshop – 18-20 July 2016
Great days discussion @turinginst talking #data access & prep “Improving the #dataanalytics process” #ATIidap pic.twitter.com/NrSARvekIJ
— Alice Data (@alice_data) July 18, 2016
Overview of #ATIidap workshop aims from Ian Horrocks & Chris Williams, quick word from Andrew Blake abt @turinginst pic.twitter.com/3v6rdVhesp
— Alice Data (@alice_data) July 18, 2016
Motivation for using meaning-centred Semantic technology can integrate heterogeneous data sources #ATIidap pic.twitter.com/4fNYUVEjxH
— Alice Data (@alice_data) July 18, 2016
How does semantic tech work? standard language 4 exchge data, vocab/schemas,ontologies&data via @turinginst #ATIIdap pic.twitter.com/jAR0F4sPOv
— Alice Data (@alice_data) July 18, 2016
Why semantic tech? Rich conceptual schemas, user centric, declarative, logic based semantics #ATIIdap pic.twitter.com/EfuzNc85EJ
— Alice Data (@alice_data) July 18, 2016
Sounds great but beware the scalability challenges #complexity theory #ATIIdap pic.twitter.com/hKGmldjXHw
— Alice Data (@alice_data) July 18, 2016
Example query re-write on rdf (resource description framework) with subject,predicate,object data structure #ATIIdap pic.twitter.com/dewiv71PcP
— Alice Data (@alice_data) July 18, 2016
Optique data solution: central repository of mapping,ontology&queries whilst data stays in existing stores #ATIIdap pic.twitter.com/l3vZjnDLpJ
— Alice Data (@alice_data) July 18, 2016
But like all data solutions there are challenges – ontology&mapping, lack skillset, maintenance, identity, variety & security #ATIIdap
— Alice Data (@alice_data) July 18, 2016
Overview of #dataprep work integral to data analysis process via @turinginst via Chris Williams #ATIIdap pic.twitter.com/6jPN88FDyf
— Alice Data (@alice_data) July 18, 2016
#DataAnalytics :Finding patterns in data&using those patterns to make predictions #datascience @turinginst #ATIIdap pic.twitter.com/dYquimvICz
— Alice Data (@alice_data) July 19, 2016
Great methodology for #dataanalysis = #datamining standard process #CRISPDM @turinginst #ATIIdap pic.twitter.com/6ndVAZSfdL
— Alice Data (@alice_data) July 19, 2016
“Data Dictionary” contains a list of all data files in the database, no. of records #dataprep #ATIIdap #DataAnalysis pic.twitter.com/6jwGKjJfGe
— Alice Data (@alice_data) July 19, 2016
#Dataprep is an integral part of #dataanalysis including data integrations, record linkages, handling formats, missing data and …#ATIIdap
— Alice Data (@alice_data) July 19, 2016
Who doesn’t love #datatidying ? Check out @hadleywickham article https://t.co/93tZdYrKaI #tidyverse #rstats #ATIIdap pic.twitter.com/TWa47ksAAy
— Alice Data (@alice_data) July 19, 2016
Where is #anomalydetection best placed in th #dataprocess? useful for both #dataprep but also #dataanalysis #ATIIdap pic.twitter.com/yxUM6u1TH3
— Alice Data (@alice_data) July 19, 2016
Example #dataprep stories from @turinginst #ATIIdap patients app 2detect link between jointpain&weather @OfficialUoM pic.twitter.com/FYhQa4qk26
— Alice Data (@alice_data) July 19, 2016
Second example #dataprep @turinginst @DECCgovuk energy flow data #ATIIdap pic.twitter.com/QAucIFu7RV
— Alice Data (@alice_data) July 19, 2016
Third example #dataprep Norwegian petroleum factpages using semantic data https://t.co/SDfDrjbfLA https://t.co/PBmkVwoQ0U #ATIIdap
— Alice Data (@alice_data) July 19, 2016
.@turinginst #ATIIdap discussed #spreadsheets & how they are involved in #dataprocess – checkout @JennyBryan talk https://t.co/fnbrXcmXq0
— Alice Data (@alice_data) July 19, 2016
Challenges via seimans “what makes #datascience hard for us?” heterogeneity, automation needs, pre-existing data, solutions & tools #ATIIdap
— Alice Data (@alice_data) July 19, 2016
Heterogeneity problems where semantic tech helps”ETLproblems when mapping is improved th data is now wrong” #ATIIdap pic.twitter.com/yay5Xm8wEo
— Alice Data (@alice_data) July 19, 2016
Nice slide on scale of all things #data #ATIIdap #DataScience pic.twitter.com/a7vnCsJTBD
— Alice Data (@alice_data) July 19, 2016
Data analytics as the 4th paradigm discussed by @turinginst #ATIidap pic.twitter.com/typP8z9ajl
— Alice Data (@alice_data) July 19, 2016
Kepler space telescope – it’s focal plane collecting TB of images of stars aiming at finding at exoplanets #ATIIdap pic.twitter.com/Vv2V92n66f
— Alice Data (@alice_data) July 19, 2016
Telescope #dataproblems : how do you tell the difference between #space noise or #exoplanets? #DataScience #ATIIdap pic.twitter.com/m1rH9lN3DK
— Alice Data (@alice_data) July 19, 2016
Finding stars is really difficult to automate from raw @NASAKepler #data but can be found #ATIIdap #DataScience pic.twitter.com/VSY9fkMnKt
— Alice Data (@alice_data) July 19, 2016
@NASAKepler detected 1000s planets & more being found from data cleaning & #GaussianProcessing #ATIIdap #DataScience pic.twitter.com/JiLSfNIcRI
— Alice Data (@alice_data) July 19, 2016
Now onto Automating #DataScience @turinginst #ATIIdap by Zoubin Gharamani talking about automatic model discovery pic.twitter.com/nIg2UWvDds
— Alice Data (@alice_data) July 19, 2016
“Bayes rules tell us how to do inference about hypotheses from data” #DataScience #ATIIdap pic.twitter.com/d9g9vIOEYe
— Alice Data (@alice_data) July 19, 2016
Automating #DataScientist – Data Analysis Tools for dummies problem, solution ingredients #DataScience #ATIIdap pic.twitter.com/hMQlzz87EX
— Alice Data (@alice_data) July 19, 2016
Machine learning – 80% data prep, 15% after ML, so only 5% is ML #ATIIdap pic.twitter.com/m9Aa2tgul6
— Alice Data (@alice_data) July 19, 2016
Data spec is never fully correct and to understand data take times just even reading the column headers #ATIIdap pic.twitter.com/tVVry6y1Rk
— Alice Data (@alice_data) July 19, 2016
We often clean data but don’t document each step we could use it for #ML #ATIIdap pic.twitter.com/RwbRKcVMKk
— Alice Data (@alice_data) July 19, 2016
My fav quote from #ATIIdap “data is like a cat it’s often purring but occasionally you get scratched” #DataScience pic.twitter.com/v3nOTS6K9q
— Alice Data (@alice_data) July 19, 2016
Tool discussed yesterday include F# @tomaspetricek exploring #referendum #data #ATIIdap pic.twitter.com/PxxlNky3Xc
— Alice Data (@alice_data) July 20, 2016
I would love to see dataDiff as a package in #rstats great presentation by @MSFTResearch at #ATIIdap @turinginst pic.twitter.com/fGjXjdzXkA
— Alice Data (@alice_data) July 20, 2016
Truly fascinating talk about #visualisation text alignment by Min Chen from @UniofOxford #ATIIdap #dataviz pic.twitter.com/exhsQcXo2z
— Alice Data (@alice_data) July 20, 2016
Check out ViTa – Visualisation for text alignments https://t.co/BZhvTgb7k9 #dataviz #ATIIdap pic.twitter.com/39lfC97yH5
— Alice Data (@alice_data) July 20, 2016
Great tlk by Maria Liskata @warwickuni abt combining heterogenous user generatedcontent to sense well-being #ATIIdap pic.twitter.com/7yueuUAfKX
— Alice Data (@alice_data) July 20, 2016
Lightening talk start with @socdm chairman Tom Khabaze at #ATIIdap @turinginst #datamining pic.twitter.com/ByhxVGIZqP
— Alice Data (@alice_data) July 20, 2016
Presented at @turinginst Both NEW #datascience initiatives! hope we can work & grow together! #DataScienceLeadership pic.twitter.com/ticzyVqNn4
— Alice Data (@alice_data) July 20, 2016