Big Data Bang Goes the Theory


Big Data

Similar Content

Browse content similar to Big Data. Check below for episodes and series from the same categories and more!

Transcript


LineFromTo

Like it or not, our world is driven by computers.

0:00:020:00:05

The details of your life, my life, and the world we live in

0:00:050:00:09

are being recorded and kept in vast stores as digital information.

0:00:090:00:13

And now, a new generation of technology

0:00:130:00:16

is analysing our data in ways that are already changing our lives.

0:00:160:00:21

We now live in a world of big data. Computers talking to each other,

0:00:210:00:26

sharing our information in ways we never believed possible,

0:00:260:00:30

sending out a stream of 1s and 0s.

0:00:300:00:32

So tonight, on Bang,

0:00:320:00:34

we find out exactly what Big Data is and what it's really

0:00:340:00:38

capable of doing, the good and the not so good of this brave new world.

0:00:380:00:43

Maggie will be looking at the frightening things people can do

0:00:430:00:46

with personal data.

0:00:460:00:47

And how some of us might be leaving ourselves vulnerable to crime online.

0:00:470:00:51

This is about someone else being really careless with your data.

0:00:510:00:54

Kind of a shocking thing to see.

0:00:540:00:57

I will be looking at how big data technology

0:00:570:00:59

can improve our lives...

0:00:590:01:01

From getting you to your holiday destination safely...

0:01:010:01:04

..to helping us to save lives.

0:01:050:01:07

This is really going to make a huge difference, isn't it?

0:01:070:01:11

And Jem will take us back to basics of what data actually is

0:01:110:01:14

and how it's become so powerful.

0:01:140:01:17

That's Bang, on data - and the new digital revolution.

0:01:170:01:21

For most of us, data means the digital information

0:01:260:01:28

that we personally use every day.

0:01:280:01:30

But there is another kind of data that we rely on without even

0:01:300:01:33

thinking about it.

0:01:330:01:35

For example, every time we take to the skies.

0:01:350:01:38

We're used to seeing streams of vapour left in the wake of planes,

0:01:400:01:43

but as we have been hearing in the news, they can also leave a data trail -

0:01:430:01:47

a stream of data that monitors the plane's performance.

0:01:470:01:52

At the headquarters of Rolls-Royce in Derby, engineers make nearly

0:01:520:01:56

half the world's passenger jet engines, including this,

0:01:560:02:00

the Trent 1000 - the engine that powers

0:02:000:02:03

many of our transatlantic flights.

0:02:030:02:06

The temperatures in the back of the engine are staggeringly high -

0:02:060:02:09

we talk about the temperature

0:02:090:02:11

as being half of the temperature of the surface of the sun,

0:02:110:02:14

and in fact it's 200 degrees above the melting point

0:02:140:02:17

of the metals that we use.

0:02:170:02:19

The only reason they don't melt is that we pass cooling air

0:02:190:02:22

through special passages

0:02:220:02:24

and channels that keeps the gas away from touching the metal.

0:02:240:02:28

The engine is full of vital components -

0:02:280:02:31

all engineered with absolute precision,

0:02:310:02:33

including an on-board computer.

0:02:330:02:36

This relatively unassuming box is the brains of the engine.

0:02:380:02:42

Not only does it control it,

0:02:420:02:44

but it also performs another crucial function.

0:02:440:02:47

It receives data from sensors buried deep within the engine,

0:02:470:02:51

measuring 40 parameters 40 times a second including temperatures,

0:02:510:02:55

pressures and turbine speeds.

0:02:550:02:58

All of the measurements received by the computer are stored,

0:02:580:03:02

and then streamed via satellite back to base, here in Derby.

0:03:020:03:05

And that's not just true for the Trent 1000.

0:03:060:03:09

It's the same for the entire fleet - that's thousands of engines.

0:03:090:03:13

A Rolls-Royce-powered engine takes off or lands every two

0:03:130:03:17

and a half seconds, somewhere in the world.

0:03:170:03:19

That's a very cool factoid.

0:03:190:03:21

And wherever they are up in the air, there's information coming back

0:03:210:03:24

about the functionality of this engine back to Derby, to here.

0:03:240:03:29

Absolutely, and they're constantly monitored

0:03:290:03:31

using clever data analytics that are looking for anything

0:03:310:03:34

going wrong in the engine, or any sign that it might need to be

0:03:340:03:37

serviced early or something like that.

0:03:370:03:39

So wherever you're flying to,

0:03:390:03:41

while you're 30,000 feet or higher,

0:03:410:03:44

thousands of streams of data are constantly sent back to base.

0:03:440:03:48

Here, computers are programmed to sift through it

0:03:490:03:53

for any anomalies.

0:03:530:03:54

So you've got 11 engines that have flagged something up on your system.

0:03:540:03:57

That's not necessarily an emergency

0:03:570:04:00

but something that needs to be looked at. Is that how it works?

0:04:000:04:02

Just an example - this has been flagged due to that -

0:04:020:04:05

a step change in the behaviour of the oil pressure parameter

0:04:050:04:08

of around ten PSI.

0:04:080:04:10

So when we start to see something that is not what

0:04:100:04:13

we would expect to see, that's the first trigger point.

0:04:130:04:16

-And that's when you guys step in.

-Absolutely.

0:04:160:04:19

Analysts here take a closer look at any problem data,

0:04:200:04:24

and get on the phone to the airlines immediately. The result?

0:04:240:04:28

Technical faults are dealt with before they become a major

0:04:280:04:31

problem, preventing delays.

0:04:310:04:34

Plus, the working life of the engine is dramatically improved.

0:04:340:04:38

One of these engines will fly around the world 450 times before it

0:04:380:04:43

needs to be overhauled - that's a hell of a mileage.

0:04:430:04:46

And just as big data works for plane engines,

0:04:460:04:49

it can also work for healthcare and the human body.

0:04:490:04:53

We've all seen computerised systems in our hospitals.

0:04:530:04:56

In intensive care, vital signs

0:04:560:04:58

need to be monitored frequently at the bedside.

0:04:580:05:02

Now traditionally, this information is noted down on paper,

0:05:020:05:05

but here at King's College Hospital,

0:05:050:05:07

a new technique is being trialled

0:05:070:05:10

that records all this information and important new data

0:05:100:05:13

in a way that could mean the difference

0:05:130:05:15

between life and death for a huge number of people.

0:05:150:05:19

Brain injuries are the most common cause of death

0:05:190:05:21

and disability in young people.

0:05:210:05:23

In UK hospitals, we treat over

0:05:230:05:26

220,000 patients with them every year.

0:05:260:05:31

Jordan Ball was admitted to King's after a motorbike accident last year.

0:05:310:05:35

His recovery is as rare as it is remarkable.

0:05:350:05:38

When I woke up, my mum and my brother came to see me.

0:05:380:05:41

Only one eye was open, and I was just staring at my brother.

0:05:410:05:45

-And a tear came down my eye.

-Don't. You will make me cry.

0:05:450:05:49

And he was like, "Mum, he knows it's me! He knows it's me!"

0:05:490:05:53

90% of people with his injury don't even wake up.

0:05:530:05:56

The 10% of people that do wake up, they need help with

0:05:560:05:59

eating, walking, they don't tend to recover like Jordan has.

0:05:590:06:03

He is on the top end of recovery.

0:06:030:06:05

-Do you realise how lucky you are, Jordan?

-I do.

0:06:050:06:08

-He does do, don't you?

-It's incredible.

0:06:080:06:11

Most patients are a lot less lucky - and for them,

0:06:120:06:16

some of the most serious problems can occur in the following hours

0:06:160:06:19

and days after admission to hospital.

0:06:190:06:22

Secondary brain injuries are a serious concern in patients

0:06:220:06:27

that suffer trauma to the brain.

0:06:270:06:29

If you imagine that this is the initial point of injury...

0:06:290:06:36

one of the biggest problems

0:06:360:06:39

is that the electrical activity in the tissue surrounding this point

0:06:390:06:43

short-circuits, and storms through all the cells,

0:06:430:06:47

using up all the energy supply, the glucose.

0:06:470:06:51

Eventually, the cells stop working and they die, never to be replaced.

0:06:510:06:56

If it was possible to know when these secondary events were starting

0:06:560:06:59

to happen, doctors could intervene and potentially limit the damage.

0:06:590:07:03

When Jordan was in ICU, he suffered seizures

0:07:050:07:09

that the doctors understood very little about.

0:07:090:07:11

So obviously, the doctors were doing everything they possibly could

0:07:110:07:13

but there was a big part of all of this that was an unknown.

0:07:130:07:17

If there was anything that could detect brain activity

0:07:170:07:20

and that doctors were then able to make a move to stop that happening,

0:07:200:07:25

that would have been wonderful.

0:07:250:07:27

King's College Hospital have been working with

0:07:270:07:29

Professor Martyn Boutelle

0:07:290:07:31

and his team at Imperial College on a big data early warning system.

0:07:310:07:36

This bolt can be fitted to the skull by a neurosurgeon without

0:07:360:07:39

even having to go into theatre,

0:07:390:07:41

turning what's going on inside the brain into data.

0:07:410:07:45

You can see, sticking out, we have a number of different sensors.

0:07:450:07:48

One of them the measures brain electrical activity.

0:07:480:07:51

Another one measures the pressure and tissue oxygen

0:07:510:07:55

and brain temperature as well.

0:07:550:07:58

And then the last one measures chemically

0:07:580:08:00

what is going on in the brain tissue.

0:08:000:08:02

So if all of the indications from a probe like this

0:08:020:08:05

-are letting you know that there is trouble ahead...

-Yes.

0:08:050:08:09

That gives you a chance to act before the secondary brain

0:08:090:08:12

injury really takes effect?

0:08:120:08:14

Exactly. That is the idea.

0:08:140:08:15

This data could produce vital new insights,

0:08:160:08:19

but it's recording between 16 and 32 channels,

0:08:190:08:23

each being measured up to 200 times a second.

0:08:230:08:26

Very quickly you can't see what's going on

0:08:270:08:29

when there's that much data.

0:08:290:08:31

Doctors in ICU need an automated solution to turn all this

0:08:310:08:35

available data into something immediately useful.

0:08:350:08:39

So Professor Boutelle turned to Cybula - big data specialists

0:08:390:08:42

who also worked on the engine monitoring systems at Rolls-Royce.

0:08:420:08:46

So, obviously,

0:08:460:08:48

an engine is completely different to a person's heart or a person's brain,

0:08:480:08:52

-but you're using the same programme to pinpoint problems in each.

-Yes.

0:08:520:08:59

-How does that work?

-Exactly.

0:08:590:09:01

Because it's doesn't matter to us

0:09:010:09:03

whether it's brain data or whether it's an aero engine.

0:09:030:09:06

It is really just the data that's the issue.

0:09:060:09:09

We are able to look at the patterns

0:09:090:09:11

in the data that characterise those events

0:09:110:09:14

through those shapes.

0:09:140:09:16

Here is an example of a brain event that we are actually looking for.

0:09:160:09:19

In this section here, we are looking

0:09:190:09:22

for this kind of spreading kind of wave.

0:09:220:09:25

The liquid goes into here...

0:09:250:09:27

So, with a big data solution at its heart, this prototype brain

0:09:270:09:31

monitoring system works in near-real-time.

0:09:310:09:34

Importantly for the busy critical care staff,

0:09:340:09:38

they could see that something was happening here so they can see,

0:09:380:09:42

"Yes, it has started to happen. We need to do something."

0:09:420:09:46

Big data is being used in flood alert, transport, natural disaster

0:09:480:09:52

response systems...

0:09:520:09:54

You name it, it can provide vital new information.

0:09:540:09:57

And always as a collaboration between research groups,

0:09:570:10:00

engineers and experts in data collection and analysis.

0:10:000:10:04

The big data revolution isn't just about storing more

0:10:050:10:08

and more unconnected information.

0:10:080:10:11

It's also about programmers

0:10:110:10:13

designing new software to spot patterns and make connections in the

0:10:130:10:16

data - and for that they need access to the data in the first place.

0:10:160:10:21

Many believe that if we could make more data

0:10:210:10:24

freely and openly available,

0:10:240:10:25

then we could crack problems in ways that were previously unimaginable.

0:10:250:10:31

The Open Data Institute is encouraging businesses

0:10:310:10:34

and the UK government to share more of their information.

0:10:340:10:38

If you make this data available,

0:10:380:10:40

it could be used to show more transparently what's

0:10:400:10:42

going on, it can make people accountable

0:10:420:10:45

for the performance of, for example, public services, health education...

0:10:450:10:48

So give us a few examples of the difference that can be made.

0:10:480:10:52

The data that was held by the Department of Transport

0:10:520:10:55

on bicycle accidents - within three days, that data had been taken,

0:10:550:10:59

turned from one form into another and somebody had written

0:10:590:11:03

an application that basically avoided

0:11:030:11:05

the bicycle accident black spots around London.

0:11:050:11:07

Just a very obvious thing that never occurred to the people

0:11:070:11:11

that had the data in the first place to do.

0:11:110:11:14

Sharing data clearly has some advantages but when it comes

0:11:140:11:18

to personal information, it can be very controversial.

0:11:180:11:21

Unless they opt out, people in England will soon

0:11:210:11:24

have their medical records put into a digital database,

0:11:240:11:27

and the NHS there plan to make them available for research.

0:11:270:11:30

It could lead to medical breakthroughs - but some

0:11:300:11:33

people still worry about releasing sensitive information like this.

0:11:330:11:37

If I'm undergoing a medical crisis,

0:11:370:11:39

I'd really like that my medical records could be shared

0:11:390:11:42

between the appropriate services in an appropriate way,

0:11:420:11:45

but blanket data publication, you have to be very

0:11:450:11:48

cautious about...in this area when it's relating to individual data.

0:11:480:11:52

And I think that both corporations

0:11:520:11:55

and governments have to be extremely careful to respect

0:11:550:11:59

and maintain the privacy of an individual.

0:11:590:12:02

The NHS say that details that could identify individuals will be

0:12:020:12:05

removed before the information is made available.

0:12:050:12:09

But medical records aren't the only sensitive data

0:12:090:12:11

we have to be conscious of.

0:12:110:12:13

Today, it seems everyone wants to know what you're doing.

0:12:130:12:16

Take a train or a bus and your journey is tracked.

0:12:160:12:20

Use a points card when you're out shopping,

0:12:200:12:22

and they keep track of what you're spending

0:12:220:12:25

and use that information to market yet more stuff to you.

0:12:250:12:28

And your bank account

0:12:280:12:30

and spending patterns are monitored by financial services.

0:12:300:12:33

Now, that's for your protection to prevent fraud,

0:12:330:12:36

but it's also for future credit checks.

0:12:360:12:39

It doesn't stop even when we think we're having some time to ourselves.

0:12:390:12:43

We now spend nearly half our waking hours in front of some screen

0:12:430:12:46

or other - but it's not always just between you and the computer.

0:12:460:12:50

Take e-mail, for example - now,

0:12:510:12:53

if you use a free service like Yahoo or Gmail, you'll be very

0:12:530:12:57

familiar with those targeted ads that appear on the page.

0:12:570:13:01

So have a look at this - this account belongs to

0:13:010:13:03

a member of our production team,

0:13:030:13:04

and you'll see that the ad that's appeared

0:13:040:13:07

is offering flights to Australia.

0:13:070:13:10

No surprise, because he uses this e-mail address

0:13:100:13:13

to book most of his travel.

0:13:130:13:14

Some free services automatically scan your search queries,

0:13:140:13:18

social networks and even e-mails to get a sense of who you are,

0:13:180:13:22

so they can target their adverts better.

0:13:220:13:25

Sometimes,

0:13:250:13:26

when a service is being offered for free, WE are what's being sold.

0:13:260:13:29

In this case, as a potential customer for a targeted ad.

0:13:290:13:33

This is what we sign up to in return for free communication,

0:13:340:13:37

map services and a world of knowledge at our fingertips.

0:13:370:13:41

Internet giants like Google, Facebook, Yahoo

0:13:410:13:43

and Twitter don't release our personal details directly to

0:13:430:13:47

advertisers, but they do generate an income from our profile.

0:13:470:13:51

So how bothered are we, really, about sharing this sort of data?

0:13:510:13:55

Let's find out from some volunteers at City University London.

0:13:550:14:00

I've got some cards for them to choose from - where red means

0:14:000:14:04

they share a lot online, green means they're sharing very little,

0:14:040:14:08

and yellow is somewhere in the middle.

0:14:080:14:11

OK, so let's start by asking you to choose a card.

0:14:110:14:14

-I'll go with one of these, actually.

-OK.

0:14:140:14:16

Yellow, but I'd like to be green.

0:14:160:14:18

I'd like to think that I protect myself.

0:14:180:14:20

I do tend to try and go through privacy settings

0:14:200:14:23

on things like Twitter and Facebook.

0:14:230:14:24

I never save card details on any of my accounts.

0:14:240:14:27

I'm a bit paranoid so...!

0:14:270:14:29

Later on we'll find out whether they share

0:14:300:14:32

as little information as they think.

0:14:320:14:34

But first - internet security expert Professor Alan Woodward

0:14:350:14:39

is showing me a murky corner of the internet

0:14:390:14:42

where personal details are bought and sold,

0:14:420:14:44

known as the Dark Web.

0:14:440:14:47

These are bulletin boards where people

0:14:470:14:49

are discussing selling now,

0:14:490:14:51

not just credit card details,

0:14:510:14:53

but all sorts of different personal information.

0:14:530:14:55

There is actually quite a black humour side to this -

0:14:550:14:58

he's saying how professional he is -

0:14:580:14:59

"This is the result of three years' hard work".

0:14:590:15:04

You can get very specific.

0:15:040:15:05

There you are, date of birth, 15.

0:15:050:15:08

Why is that important? Because in the UK, for example,

0:15:080:15:10

a name and a date of birth, to a credit agency,

0:15:100:15:13

can be considered a unique combination.

0:15:130:15:16

So how much information does someone need to glean

0:15:160:15:19

for it to be really useful?

0:15:190:15:21

Anything you can add that starts to make your reference more unique.

0:15:210:15:25

So you don't need much of that.

0:15:250:15:27

Including, for example, put your home address down,

0:15:270:15:29

your date of birth,

0:15:290:15:31

if you can get hold of something like social security numbers,

0:15:310:15:34

National Insurance numbers, in the UK.

0:15:340:15:36

I could be whoever I want, from this list of people I can buy.

0:15:360:15:40

And to give you an idea of how easy it is

0:15:400:15:42

to collect data once it's out there,

0:15:420:15:44

Alan's asked James Lyne from Cyber Security giant Sophos to join us.

0:15:440:15:49

He's using some legal and freely available data harvesting tools

0:15:490:15:53

to gather information about our volunteers.

0:15:530:15:57

What these tools all really do,

0:15:570:15:59

is, they take individual pieces of information,

0:15:590:16:02

that in themselves would be completely innocuous,

0:16:020:16:04

so a name, a social media profile,

0:16:040:16:08

an e-mail address,

0:16:080:16:09

and they combine them together using these Big Data techniques,

0:16:090:16:13

expanding the information massively

0:16:130:16:15

and make a very accurate profile of what that person looks like.

0:16:150:16:19

Surprisingly, James doesn't need much information

0:16:190:16:21

to build an accurate profile.

0:16:210:16:23

People don't realise that often photos, tweets

0:16:230:16:26

and other data they may upload, contain GPS coordinates, by default.

0:16:260:16:31

So you might not give away your address or postcode,

0:16:310:16:34

but you're giving away your location to plus or minus 10-15 metres.

0:16:340:16:38

You might see 160 tweets that correlate to that location.

0:16:390:16:44

Plus, the tweet content may talk about being at home,

0:16:440:16:46

doing something for the kids.

0:16:460:16:48

It gives away very clearly that's where they live.

0:16:480:16:51

With potentially two to three years' backlog of data,

0:16:510:16:54

that's enough to build a profile of anyone.

0:16:540:16:56

And we'll be letting the volunteers know

0:16:560:16:58

what we've found out about them, later on.

0:16:580:17:01

From keeping planes in the air to stealing identities,

0:17:010:17:05

if you've got access to data

0:17:050:17:06

you can build some incredibly powerful tools.

0:17:060:17:10

We first found this out long before the Big Data revolution,

0:17:100:17:13

over 70 years ago.

0:17:130:17:15

Jem takes up the story.

0:17:150:17:17

During World War II,

0:17:170:17:18

brilliant minds gathered in these buildings at Bletchley Park

0:17:180:17:22

to decipher encrypted German messages.

0:17:220:17:25

The results helped shorten the war by two years,

0:17:250:17:27

and saved countless lives.

0:17:270:17:30

At first they used human computers, real people sat at desks

0:17:300:17:34

cracking the codes by pen and paper.

0:17:340:17:37

But by 1943, the engineers had realised machines might be able

0:17:370:17:41

to do a much better job -

0:17:410:17:43

machines that processed with simple on/off switches.

0:17:430:17:47

So how do you link simple switches to answer a problem?

0:17:470:17:51

Well, I've got two of them here.

0:17:510:17:53

Essentially, it's a tiny computer -

0:17:530:17:55

I now need to programme it.

0:17:550:17:57

Now the input to this I'm assigning to "Is it Monday?"

0:17:570:18:03

so if it is Monday - it gets a positive input.

0:18:030:18:06

If it isn't - it gets nothing at all.

0:18:060:18:10

This switch, the input for that, "Is it 7.30?"

0:18:100:18:15

And the output here, I'm assigning "Good time to watch BBC1?"

0:18:150:18:21

Right. Let's start using the computer. Is it Monday?

0:18:210:18:26

Yes. Is it 7.30?

0:18:260:18:29

Yes.

0:18:290:18:30

The computer says it is an ideal time

0:18:320:18:35

for checking out some science on your telly.

0:18:350:18:37

At Bletchley Park,

0:18:380:18:39

engineers hooked up their own network of on/off switches

0:18:390:18:43

to crack the German codes.

0:18:430:18:45

Where I've used two switches, this machine used over 2,000,

0:18:450:18:49

and it was aptly called Colossus.

0:18:490:18:52

Back in the day, Colossus was revolutionary

0:18:520:18:55

because it used these electronic valves

0:18:550:18:58

for its fast and reliable switching.

0:18:580:19:01

Fast and reliable for its time, because within ten years,

0:19:010:19:04

that same job was being done by transistors, considerably smaller.

0:19:040:19:09

Now, I pulled this out of a modern computer.

0:19:090:19:13

The central processing unit - the chip that does the switching.

0:19:130:19:16

And on there, there are 54 million transistors.

0:19:160:19:21

And it's that kind of miniaturisation

0:19:210:19:23

that has revolutionised what we can do with computers.

0:19:230:19:28

Using switches to process ons and offs is how all computers work,

0:19:280:19:33

but today they're known as 1s and 0s.

0:19:330:19:36

You might not think you can get much subtlety

0:19:370:19:39

out of a switch just being on or off,

0:19:390:19:42

but there millions of them at work for you right now,

0:19:420:19:46

sending out a stream of 1s and 0s,

0:19:460:19:49

sequentially telling every pixel on your screen

0:19:490:19:53

just how bright or dark they need to be.

0:19:530:19:56

Your holiday snaps? A sequence of 1s and 0s.

0:19:560:19:59

Your MP3s? A load of 1s and 0s.

0:19:590:20:02

And every letter on a keyboard

0:20:050:20:06

is an eight digit code of 1s or 0s, to a computer.

0:20:060:20:11

For Colossus, data was fed in on paper tape.

0:20:110:20:15

Each punched hole, or unpunched space, acted as a 1 or a 0.

0:20:150:20:20

Today, individual 1s or 0s are called bits.

0:20:200:20:25

Nowadays we reckon eight bits are a byte.

0:20:250:20:28

And to match the storage capacity of something like a hard-drive -

0:20:280:20:33

250GB - your piece of paper would need to go to the moon

0:20:330:20:39

and back, and probably back to the moon again.

0:20:390:20:41

So how can we pack so many bits into such a little box?

0:20:430:20:46

Well, most hard-drives work using magnets.

0:20:460:20:50

Computers magnetise an area of a disc

0:20:500:20:53

like I'm magnetising these bolt-heads.

0:20:530:20:55

I'll use magnetic North for 1 and South for 0,

0:20:580:21:01

which can then be detected later.

0:21:010:21:04

In a real hard drive the magnetisable areas

0:21:040:21:07

are sitting on a spinning disc.

0:21:070:21:09

Quite literally, on there, there are millions and billions

0:21:090:21:13

of magnetisable areas,

0:21:130:21:15

each of them so small, that they're smaller than a virus.

0:21:150:21:19

10,000 of them would fit across the width of a human hair.

0:21:190:21:22

And this is spinning around at 100 times a second or more.

0:21:220:21:27

And yet still the computer is extracting

0:21:270:21:30

a phenomenal amount of data incredibly quickly.

0:21:300:21:33

Now just because this all seems like

0:21:330:21:35

some ridiculous fantasy piece of engineering

0:21:350:21:39

doesn't mean you shouldn't have a go at building your own.

0:21:390:21:43

Now where's that MDF?

0:21:430:21:44

As a team, we are putting together a massive four byte hard-drive.

0:21:470:21:52

Four rings of eight magnets on a spinning platter.

0:21:520:21:55

As the disc spins past the electro magnet

0:21:550:21:58

it reads each bit as a 0 or a 1.

0:21:580:22:01

I've left Chris and Jim to secretly encode each ring

0:22:030:22:06

as a sequence of eight bits - enough for a letter on a keyboard.

0:22:060:22:10

That's a zero.

0:22:120:22:14

And I decipher the code back into letters.

0:22:140:22:17

What takes me 30 seconds,

0:22:170:22:19

a computer does at nearly the speed of light.

0:22:190:22:22

Oh! I mean, that's just brilliant! What can I say?

0:22:220:22:26

Milk and two sugars, please.

0:22:260:22:29

Our ability to store vast quantities of information digitally,

0:22:310:22:35

a bit like this, and process it with tiny, lightning fast switches,

0:22:350:22:39

is what's driven computing,

0:22:390:22:41

and opened up this whole field of Big Data.

0:22:410:22:45

And as engineers develop even better storage,

0:22:450:22:47

and even faster processing,

0:22:470:22:49

Big Data applications are going to have a bigger and bigger influence

0:22:490:22:53

on our everyday lives.

0:22:530:22:55

Back at City University, London, it's results time

0:22:560:22:59

in our personal data experiment.

0:22:590:23:02

First up, those who chose green,

0:23:020:23:04

believing they put no personal data about themselves online.

0:23:040:23:08

Could we find anything about them?

0:23:080:23:10

You've got my mobile number in there.

0:23:110:23:13

Which I'm a bit surprised about but I'm guessing that might come

0:23:130:23:15

from a shopping website or something like that.

0:23:150:23:17

It was from somewhere that you'd published it.

0:23:170:23:19

But it's not just his mobile that's public.

0:23:190:23:23

Do you use that e-mail address for resetting certain accounts?

0:23:230:23:26

Uh...yes.

0:23:260:23:27

I think when I said I was green

0:23:270:23:29

I'd forgotten how I put some of those things in.

0:23:290:23:31

It's amazing how much this information just stays there.

0:23:310:23:33

Absolutely.

0:23:330:23:34

There's actually an astonishing number of cases

0:23:340:23:37

where people thought they were really, really secure,

0:23:370:23:39

and gave nothing away, but in reality,

0:23:390:23:41

posted an awful lot of information online.

0:23:410:23:43

And even for those in the red group, who knew they had data online,

0:23:430:23:47

there were still surprises.

0:23:470:23:49

Under my name there's only my phone number and my e-mail address.

0:23:490:23:51

Whereas with the other guys it's their full home details,

0:23:510:23:55

addresses, many phone numbers for them.

0:23:550:23:58

This isn't about you being careless with your data.

0:23:580:24:01

This is about someone else being really careless with your data,

0:24:010:24:05

and all those other coaches, and the names of those children.

0:24:050:24:09

All the names just listed out there,

0:24:090:24:11

it's kind of a shocking thing to see.

0:24:110:24:14

In fact, all of our groups were quite shocked.

0:24:140:24:17

Can anybody who is not a member of this website just access it,

0:24:170:24:21

or do you have to, you know, become a member of it?

0:24:210:24:23

It's all accessible. We didn't register to get that.

0:24:230:24:26

That's good then(!)

0:24:260:24:29

So what, do you think, constitutes safe online behaviour?

0:24:290:24:32

So, firstly, don't be too paranoid.

0:24:320:24:35

I use Twitter, I use LinkedIn,

0:24:350:24:37

I enjoy online services.

0:24:370:24:39

But we have to think a little carefully

0:24:390:24:41

about the information we upload.

0:24:410:24:42

Do we want to give away the location of this photo, in our back garden,

0:24:420:24:47

that contains the location of our house, plus or minus 10-15 metres?

0:24:470:24:50

Secondly, consider lying online.

0:24:500:24:54

Now, I know that sounds like a strange thing to say

0:24:540:24:56

in the real world, but when a service provider says,

0:24:560:24:58

"What's your date of birth?",

0:24:580:25:00

don't tell them, and if they demand you give them that information,

0:25:000:25:04

give them a fake answer.

0:25:040:25:06

Keep note of that for future purposes, for a reset,

0:25:060:25:08

but don't tell them the truth.

0:25:080:25:10

And remember, once something's on the internet,

0:25:100:25:13

you really can't delete it.

0:25:130:25:15

So think before you put anything there in the first place.

0:25:150:25:18

So we should be wary about any information

0:25:180:25:21

that's out there and unrestricted.

0:25:210:25:23

But as Liz is finding out, Big Data's offering up

0:25:230:25:26

more than just new ways to reveal our identity.

0:25:260:25:29

It's also offering up

0:25:290:25:30

a new generation of facial recognition techniques

0:25:300:25:33

that eventually, may even be able to tell how we're feeling.

0:25:330:25:37

Two dimensional facial recognition systems have a wide variety

0:25:370:25:41

of very useful applications, but they're not completely foolproof.

0:25:410:25:45

If I hold up a picture... of Jem Stansfield's head,

0:25:450:25:53

because the system only analyses in two dimensions,

0:25:530:25:56

this flat picture can fool it into thinking

0:25:560:25:59

I'm someone completely different.

0:25:590:26:01

2D systems mostly work by measuring the distance

0:26:020:26:05

between your key facial features.

0:26:050:26:07

But the technology can be easily confused.

0:26:070:26:10

Here at the Centre for Machine Vision

0:26:100:26:13

at the Bristol Robotics Laboratory,

0:26:130:26:15

Mark Hansen and his team have made a system

0:26:150:26:18

that can see in 3D, like we can.

0:26:180:26:19

This booth uses a high speed camera,

0:26:210:26:24

and five near-infrared flashes to build up a 3D likeness of my face.

0:26:240:26:30

So it's captured all the images.

0:26:300:26:31

God, I look hideous!

0:26:310:26:33

That's an awful photograph!

0:26:330:26:36

That is a 3D image of my face, and it's saying, "Access denied."

0:26:360:26:39

Why is it not recognising me?

0:26:390:26:41

Because we haven't enrolled you on the system yet.

0:26:410:26:45

I walk through a few more times and Mark programmes the computer

0:26:450:26:48

to recognise the face its detecting as mine.

0:26:480:26:52

Yay! Good afternoon, Liz. Excellent.

0:26:520:26:56

We're extracting the key features of your face...

0:26:560:26:59

The height of my cheeks, the bump on my nose,

0:26:590:27:01

is that way it all boils down to?

0:27:010:27:03

Absolutely, yep.

0:27:030:27:05

This is Big Data facial recognition,

0:27:050:27:07

matching patterns captured across my entire 3D image,

0:27:070:27:11

with what it has already learned about my face.

0:27:110:27:14

It's more robust than 2D systems,

0:27:140:27:16

and you'd need a twin,

0:27:160:27:17

or a 3D print-out of someone's head, to fool it.

0:27:170:27:21

But the most advanced on display today isn't 3D.

0:27:210:27:25

It's actually 4D.

0:27:250:27:26

This system can process my reactions to a series of YouTube clips,

0:27:270:27:31

in real-time, guessing what I'm feeling.

0:27:310:27:34

These kinds of technologies aren't ready to leave the lab quite yet,

0:27:360:27:40

but this is how robots could see us in the future,

0:27:400:27:43

and identify what we are thinking.

0:27:430:27:46

Whoa!

0:27:460:27:48

And you can't help but imagine what else might be on the horizon.

0:27:480:27:52

Can Big Data predict the future?

0:27:540:27:57

This may seem a little far-fetched,

0:27:570:27:59

but in many ways it's already happening.

0:27:590:28:01

Police forces in the UK are trialling the use of data,

0:28:010:28:05

like weather forecasts and records of break-ins,

0:28:050:28:08

to predict where the next crimes might happen.

0:28:080:28:11

And online retailers are planning to pre-package our goods

0:28:110:28:14

before we've even ordered them.

0:28:140:28:16

Whether we like it or not, Big Data is here,

0:28:160:28:19

and it's going to change our world

0:28:190:28:21

in ways we could never have imagined.

0:28:210:28:23

Next week on Bang Goes the Theory, we look at the science of ageing.

0:28:250:28:29

And we'll be joined by Sir Terry Wogan.

0:28:290:28:32

Meanwhile, if you fancy working in Big Data,

0:28:320:28:35

check out our careers guide at bbc.co.uk/bang.

0:28:350:28:37

And for information on keeping your data secure,

0:28:400:28:43

follow the links to the Open University website,

0:28:430:28:46

and play their interactive privacy game.

0:28:460:28:48

Download Subtitles

SRT

ASS