Technology + Creativity at the 麻豆约拍 Feed Technology, innovation, engineering, design, development. The home of the 麻豆约拍's digital services. 2021-09-14T12:07:03+00:00 Zend_Feed_Writer /blogs/internet <![CDATA[Introducing machine-based video recommendations in 麻豆约拍 Sport]]> 2021-09-14T12:07:03+00:00 2021-09-14T12:07:03+00:00 /blogs/internet/entries/f74ea410-5ec4-4add-9daa-a29d25176ccd Robert Heap <div class="component prose"> <p>From this week we are adding a new feature to our short form video pages on the 麻豆约拍 Sport website.</p> <h4>Related clips</h4> <p>On every video page in 麻豆约拍 Sport you鈥檒l see a related links section. This is usually put together by our editorial colleagues, a routine task which can be time consuming. They have good knowledge about related content, but cannot know about everything, which means that the audience do not see some content that might be relevant.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09vxrj6.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09vxrj6.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09vxrj6.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09vxrj6.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09vxrj6.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09vxrj6.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09vxrj6.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09vxrj6.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09vxrj6.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <h4>Short Form Video & Datalab</h4> <p>With this in mind we have worked with Datalab (our in-house 麻豆约拍 machine learning specialists) to create an algorithm-based video recommendations engine which we hope will help our audience see more of the content they love whilst reducing the editorial overhead of creating a set of relevant links.</p> <h4>Algorithm-based recommendations</h4> <p>The engine works by combining content information about the clip with more information about user journeys from across the 麻豆约拍. This combination of multiple sources should provide a more relevant list of videos for our audience to watch next. This is the first cross product engine supporting user journeys across News and Sport, which means that you may see news, sport or a combination of both in the module. This is the first version of the short form video recommender, there will be more improvements to come, as we continue to develop it.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09vxz05.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09vxz05.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09vxz05.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09vxz05.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09vxz05.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09vxz05.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09vxz05.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09vxz05.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09vxz05.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <h4>Launching the new recommendations</h4> <p>The plan is to launch this new functionality on all American Football clips the week commencing 13 September to give the engine a trial with freshly published content and to give us the opportunity to measure its impact. Provided all is well, we will gradually release this feature across all 麻豆约拍 Sport videos. After that we will begin to roll out the same engine for 麻豆约拍 News. Beyond that we will continue to work with editorial colleagues to improve it over the coming months.</p> <p>If you have any feedback on this new video experience, please leave your comments below.</p> </div> <![CDATA[Philip 21 - an interactive story exploring race, love and modern Britain]]> 2021-07-20T13:12:44+00:00 2021-07-20T13:12:44+00:00 /blogs/internet/entries/7007a13e-2c58-4011-9d38-bfeef2c56cbc Joey Amoah <div class="component prose"> <p><a href="/taster/pilots/philip-21">Philip 21</a> is a brand new narrative <a href="/rd/object-based-media">object-based media (OBM)</a> experience from <a href="https://www.halucid.co.uk/">halucid_ theatre company</a> that takes the premise of a date with a young black man and turns it into an introspective examination of race, love and modern Britain. In this blog post, we look at the techniques and mechanics that underpin this and other branching narrative experiences, examining how they keep audiences engaged compared to traditional media.</p> <p>In this regard, two specific areas of the project need to be examined; the fabula and syzhuet of the authored experience; and the dual narrative created by having the outcome of the story that each audience member sees be dependent on the choices they make.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09pvgrn.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09pvgrn.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09pvgrn.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09pvgrn.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09pvgrn.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09pvgrn.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09pvgrn.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09pvgrn.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09pvgrn.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <ul> <li> <h2><a href="/taster/pilots/philip-21"><strong>麻豆约拍 Taster - Try Philip 21</strong></a></h2> </li> </ul> <p>The 'fabula' is a literary term used to refer to the raw material of a story, what takes place and the chronology of events. It describes the skeleton of the experience, story beats and the audience journey. The 'syzhuet' describes how the story is organised and presented to audiences. It covers everything from the perspective the story is told from to the arrangement of actors and the cinematography of a scene.</p> <p>In traditional media, the audience is passive and cannot interact with what is being shown. Storytellers may play with the audience journey and how the story is presented, but they never surrender control of the constituent elements, and they do not offer alternative and equally valid branches. halucid_ have had to wrestle with the challenge of making the fabula and syzhuet work together while also giving up a degree of creative control.</p> <h2>The fabula and syzhuet of the authored experience</h2> <p>In the case of Philip 21, the fabula could be described as being broadly linear. The audience arrives for a date with Philip; they engage in conversation with him and are ultimately asked if they would like a second date or not. However, this simple sequence is not what is received by the audience. What does take place is a fragmented series of scenes where progress can only be made through user choices and engagement. halucid_ plays with the fabula and utilises the narrative setting, the conversational structure and the first-person perspective to drive the experience forward. Philip 21 takes the narrative setting, that of a date, and uses it to establish the boundaries of the world and to inform us on how we should behave. Since many of us will be familiar with this experience from our own lives, halucid_ leans on the understanding of social norms to get audiences to participate in the way that they desire. Building on this, halucid_ uses the codes and conventions of conversations to create an internal metre that demands our engagement. It is only through responding to Philip鈥檚 questions and internalising his responses that the narrative can advance. In so doing, it bids the viewer to suspend their disbelief and enter into the story world. Finally, the first-person perspective means that the audience is always focused on the subject, unable to look away, and the fixed camera position creates a sense of intensity, intimacy and immediacy, which is further heightened by the one-to-one interaction enjoyed with Philip.</p> <p>These creative decisions help move the narrative forward and straddle the line between the fabula and how the authored experienced is presented. This is particularly important to the narrative OBM experience because both content creators and audiences are jointly responsible.</p> <p>Philip 21 can be navigated in several different ways, with each choice offering a different route through the experience. These routes have been created by halucid_, but audiences have the freedom to select which paths to follow. They can choose a path from the outset or change course at any point, meaning that the syzhuet presented is unique to each viewer. Philip 21 has no primary path, and as a result, all routes through the experience and all outcomes received are equally valid.</p> <h2>The dual narrative</h2> <p>The second interesting element worthy of discussion about Philip 21 is how it goes about creating a dual narrative. On the one hand, Philip 21 is an authored experience, a story that halucid_ is seeking to tell, and on the other, there is the audience experience and what the audience bring and take away from the experience. The audience is integral to delivering Philip 21, and by taking part, they are positioned not as passive watchers but are co-protagonist alongside Philip. The viewer becomes a character in the story and must decide how to approach the situations in the story.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09pyb3f.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09pyb3f.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09pyb3f.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09pyb3f.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09pyb3f.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09pyb3f.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09pyb3f.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09pyb3f.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09pyb3f.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>(Jason Dodd Photography)</em></p></div> <div class="component prose"> <p>The viewer must decide if they will approach the work sincerely as their authentic self or assume a persona/play a role that goes against type. This choice, whether conscious or unconscious, determines where the story goes. This way, what is presented back to the audience is a reflection of choices made. Philip 21 could be considered as having two narratives. The authored story and one reflected to the audience based on how they interact with the experience.</p> <h2>What comes next</h2> <p>Narrative OBM experiences are still in their infancy, and we are only beginning to understand the impact of content creators giving over control to audiences. Altering the syzhuet of a story presents a wide array of creative opportunities for creators and audiences, but these choices will need to work in tandem with the fabula and not distract from the story. We are unsure of what works best and are eager to see further experiments in this area.</p> <p>Likewise, the interplay between the authored experience and the one reflected back at audiences is something we are keen to explore in more detail. What would happen if additional choices were offered to audiences? What would occur if a story was told episodically and not in one session? How would audiences experience this? The only way to find out is to build and test these kinds of experiences.</p> <ul> <li><strong><a href="/makerbox/tools/storyformer">麻豆约拍 Makerbox - StoryFormer</a></strong></li> <li><strong><a href="/rd/blog/2018-10-object-based-media-tools-storyformer">麻豆约拍 R&D -聽StoryFormer: Building the Next Generation of Storytelling</a></strong></li> <li><strong><a href="https://www.youtube.com/watch?v=oLWp46ZsI5U">YouTube - 麻豆约拍 R&D Explains: How to make a personalised story</a></strong></li> </ul> </div> <![CDATA[Building a WebAssembly Runtime for 麻豆约拍 iPlayer and enhanced audience experiences]]> 2021-03-01T10:19:41+00:00 2021-03-01T10:19:41+00:00 /blogs/internet/entries/39f42525-77db-43b0-81bb-70a0d5b1f062 Juliette Carter <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09ffmq9.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09ffmq9.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09ffmq9.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09ffmq9.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09ffmq9.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09ffmq9.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09ffmq9.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09ffmq9.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09ffmq9.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>At <a href="/rd">麻豆约拍 Research & Development</a>, we are investigating how we evolve our current multimedia applications to move beyond video by using <a href="/rd/object-based-media">object-based media (OBM)</a>. OBM allows us to develop future audience experiences which are immersive, interactive and personalised.</p> <p>There is an ever-increasing number and range of audience devices capable of playing back OBM experiences. The challenge we now face is universal access - How can we get all members of the audience to enjoy OBM experiences on any device, and how do we do this sustainably and at minimal cost?</p> <p>Our <a href="/rd/projects/render-engine-broadcasting">Render Engine Broadcasting (REB)</a> project is investigating new technologies that will allow the 麻豆约拍 to deliver these OBM experiences at scale to all of our audiences, no matter what device they use. Our ultimate goal is to deliver real-time and fully rendered experiences on any device or platform and write the software to do it only once. We have been investigating the use of WebAssembly as a cross-platform technology for this.</p> </div> <div class="component prose"> <p><strong>What is WebAssembly?</strong></p> <p><a href="https://webassembly.org/">WebAssembly</a> (wasm) is a Universal Binary format designed as a sandboxed environment and a portable compilation target, which means that the same wasm module can run securely on multiple platforms. A number of strongly typed languages such as C/C++, Rust or AssemblyScript can compile to WebAssembly, making it language agnostic. This makes it an attractive option for adoption in the industry as it enables developers to use languages they already know to produce wasm binaries.</p> <p>When WebAssembly was first developed a few years ago, its target platform was the web. The aim was to compile fast and efficient system-level code and have it run in the browser. Compute intensive applications, such as real-time interactive rendered graphics, could be run in a web browser at near-native performance. This also enabled some native applications to be ported to the web, increasing their reach and usage. These include <a href="https://www.google.com/earth">Google Earth</a>, which renders 3D representations of satellite image in the browser, and <a href="https://en.wikipedia.org/wiki/AutoCAD">AutoCad</a>, which now offers a WebApp to create and edit CAD drawings.</p> <p>In the last couple of years, WebAssembly outside of the browser has been gaining traction. A number of native wasm runtimes have been developed, which has enabled the use of WebAssembly for microservices and server applications. In 2018, the website security company <a href="https://blog.cloudflare.com/webassembly-on-cloudflare-workers/">Cloudflare</a> announced the use of WebAssembly on their edge workers, allowing users to deploy secure and fast serverless code compiled to wasm. And the edge cloud platform provider聽<a href="https://www.fastly.com/blog/announcing-lucet-fastly-native-webassembly-compiler-runtime">Fastly</a>聽offers new wasm-based edge computation using their native runtime Lucet.</p> <p>The portability of WebAssembly across multiple platforms and its security model are the key reasons for 麻豆约拍 R&D鈥檚 interest in using this technology as a compilation target for media experiences. As a public service broadcaster, we need to deliver value to all of our audiences, regardless of the device they use. Where traditionally, a codebase for each target platform and a different team to maintain each codebase would be required, the use of WebAssembly potentially allows for a much more sustainable developer ecosystem. It enables media software applications to be created once, from a single codebase, compiled to WebAssembly and deployed on any client or server platform depending on the capabilities required. It also offers numerous advantages compared to previous multimedia or cross-platform technologies (such as Flash or Java Runtime Environment). Indeed, it is language agnostic, security-focused, has predictable performance, and works inside and outside the browser. WebAssembly is also an open standard, which encourages its adoption.</p> <h4>How have we used WebAssembly?</h4> <p>We wanted to demonstrate how we could use WebAssembly to deliver media experiences that can run on many target platforms built from a single codebase. To do that, we implemented an example media application written in C++, which we compile to WebAssembly, giving us a wasm module. We designed this application to look like a version of 麻豆约拍 iPlayer, allowing users to select content, watch video programmes, AND play OBM experiences. We call this application the Single Service Player (SSP).</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p09ffygy.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p09ffygy.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p09ffygy.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p09ffygy.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p09ffygy.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p09ffygy.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p09ffygy.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p09ffygy.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p09ffygy.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>An example of how object-based media experiences could appear within 麻豆约拍 iPlayer.</em></p></div> <div class="component prose"> <p>To run our SSP wasm module, we needed a wasm runtime. The SSP makes use of some low-level media functionality, which isn鈥檛 scoped by the WebAssembly specification. To enable wasm modules to make use of these low-level multimedia capabilities, they need to be implemented in the runtime and made available to the wasm module through a set of imports. Examples of such capabilities include:</p> <ul> <li>Windowing and rendering 鈥 In most cases, a multimedia application will have some graphical elements to it, which requires things to be drawn in a window (such as video frames or a UI screen).</li> <li>User inputs 鈥 An interactive multimedia experience expects user inputs, such as keyboard or mouse events.</li> <li>Media encoding and decoding 鈥 To efficiently encode and decode media (such as video frames or audio packets), it is preferable to use the host鈥檚 hardware resources where possible.</li> </ul> <p>As there is currently no WebAssembly runtime that offers these media capabilities, we've decided to create our own.</p> <p>There are already some efforts in specifying ways a wasm module can talk to the host. <a href="https://wasi.dev/">WASI (the WebAssembly System Interface)</a> proposes a set of standardised POSIX-like syscalls (the programmatic way in which a computer programme communicates with the host system) for libc functionality, mainly file handling and networking. These are called from the wasm module and implemented in the runtime.</p> <p>We decided to use a similar approach to allow our SSP wasm module to communicate with the host, enabling it to have access to low-level media functionality. This involved identifying all the platform-specific media capabilities that could not be compiled to wasm and implementing them in the runtime. These capabilities were then made accessible to the wasm module through a set of platform-independent syscalls passed as imports.</p> <p>This figure illustrates the whole process, from writing a media experience as software (such as the SSP) to running it as a wasm module on any device. The steps are detailed below.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p098btpl.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p098btpl.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p098btpl.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p098btpl.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p098btpl.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p098btpl.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p098btpl.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p098btpl.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p098btpl.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>The first step was to design the multimedia sys-call API behind which we would implement our cross-platform multimedia capabilities in the runtime. It needed careful consideration to ensure it was thread-safe and honoured the wasm security requirements around memory access. In the figure above, we use reb_decode_video() as an example syscall, which our SSP application can use to access low-level multimedia functionality, such as utilising the system鈥檚 hardware for video decoding.</p> <p>Our SSP code was compiled to wasm using the clang compiler and the wasi-sdk toolchain, and the required syscalls are added as imports to the wasm module.</p> <p>We then built the multimedia wasm runtime, consisting of two parts. The first one is the execution environment for wasm modules, which allows us to load and run a wasm module. For this, we embedded <a href="https://wasmtime.dev/">Wasmtime</a>, a <a href="https://bytecodealliance.org/">ByteCode alliance</a> project based on <a href="https://github.com/bytecodealliance/wasmtime/tree/main/cranelift">Cranelift</a>, which generates the machine code for the target platform from the wasm binary.</p> <p>The second part of our runtime is the implementation of the low-level multimedia functionality. For this, we created a cross-platform C++ library with input detection, networking, windowing, graphical rendering, and media decoding, which sits behind our carefully designed syscall APIs. We compiled our library for several target platforms, such as Linux, macOS, Windows, Raspberry Pi and Android. We also wrote some glue code to connect the two parts.</p> <h4>Where do we go from here?</h4> <p>A wasm runtime capable of executing multimedia applications opens a lot of possibilities, principally around flexible compute. Flexible compute allows us to run computationally demanding applications by dividing up the workload between available resources. These resources could be located locally (a laptop, games console or phone in your house), in the edge, or the cloud.</p> <p>As we move towards delivering fully rendered real-time interactive experiences, the flexible compute approach becomes an attractive solution to the computational demands of such applications. We could, for example, consider segmenting a rendered frame into several tiles or objects, each of those rendered on a separate available compute resource. Many systems approach this problem by running specific compute tasks in containers across the available devices and platforms. We hope to use our work and accrued knowledge in developing the wasm multimedia runtime to investigate a viable alternative to the container approach for distributed media applications. We are looking into using wasm modules to perform secure and fast computation on any remote compute nodes.</p> <p>Our runtime, capable of performing media services such as rendering and decoding or encoding of rendered video frames, can be used to display the final experience to the user on a client device and to execute the remote computational tasks as wasm modules. Using WebAssembly combined with a flexible compute approach, we hope to develop technology that allows the audience to access any future experience, regardless of their devices at home.</p> </div> <![CDATA[The complexities of creating a new 'follow topic' capability]]> 2021-01-13T13:49:58+00:00 2021-01-13T13:49:58+00:00 /blogs/internet/entries/c80f349a-cec8-4f6b-9afe-913b8dee752d Dave Lee <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p093wjx9.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p093wjx9.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p093wjx9.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p093wjx9.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p093wjx9.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p093wjx9.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p093wjx9.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p093wjx9.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p093wjx9.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>Exploring how the visual emphasis of our Follow UI works alongside other calls to action.</em></p></div> <div class="component prose"> <p>The 麻豆约拍 is committed to creating a truly personalised experience for everyone. We believe that a more personal 麻豆约拍 helps us build a deeper connection with our audiences, and helps us to extract more value from online. We have so much to offer across our products and services and one of our challenges is to help our audience discover the content we have.</p> <p>We have a number of approaches to achieve this but alongside automated personalisation like smarter recommendations, we want to enable our audience to tell us what they want to see more of. So, we鈥檙e exploring the idea of enabling audiences to follow topics, a new capability that could work as a service for all of our teams and products.</p> <p>This consistent and collaborative approach is not only more efficient, but would allow users鈥 preferences to follow them wherever on the 麻豆约拍 they are. This blog posting will try to convey the complexities around creating this new follow capability.</p> <h4>Context</h4> <p>The 麻豆约拍 creates thousands of different content items on a daily basis, ready to be consumed by people in the UK and all around the world. But how might we improve the discoverability of that content? How might we offer new content to those who are interested in it?</p> <p>The 麻豆约拍 麻豆约拍 team started working on the idea of 鈥榯opics鈥 in summer of 2019. Working with colleagues from around the 麻豆约拍, we have developed the technology to present a Pan-麻豆约拍 topic to the audience.</p> <p>Very early on in the work on topics it was recognised that there was an opportunity to harness the power of topics by allowing the audience to choose which topics they personally find interesting so we can personalise their experience of the 麻豆约拍 and notify them when relevant content is available. Examples of this already exist with the 麻豆约拍 such as the My Sport functionality in the 麻豆约拍 Sport app; subscribing to podcasts in 麻豆约拍 Sounds and adding a programme to your favourites in 麻豆约拍 iPlayer.</p> <p>The prior implementations have been undertaken in a product-specific fashion, but we are building 鈥檉ollow鈥 in a unified way, aiming for a seamless audience and technical approach across all products. Behind the scenes, the mechanism for all these types of 鈥榝ollowing鈥 is common. How might we create an experience which harnesses all these 鈥榝ollow鈥 events to aid in content discovery by putting new and relevant content to our audiences quickly and efficiently?</p> <h4>Principles</h4> <p><a href="/blogs/internet/entries/8673fe2a-e876-45fc-9a5f-203c049c9f9c">For further background, read Matthew Clark鈥檚 blog post</a>.</p> <p><strong>Platform Consistency</strong></p> <p>All new developments will occur within the 麻豆约拍鈥檚 existing web technology platform unless it is not technically possible. WebCore utilises a mono-repository and because all the teams can access everyone鈥檚 code, it becomes much easier to be consistent across all our teams鈥 work. The graphical components are all stored in a central repository and are available to all teams to utilise and integrate into their own developments.</p> <p><strong>Reuse</strong></p> <p>All existing solution building blocks need to be taken into account and reused where applicable. If anything new is developed, it must be straightforward for other teams to utilise or develop further to meet their own needs.</p> <p><strong>Collaborate</strong></p> <p>Collaboration with other teams is key to this project鈥檚 success. This work is being carried out within a complex organisation and as such, numerous teams and colleagues will be stakeholders in it. Openness and a readiness to collaborate will pay dividends later.</p> <h4>The Key Considerations and Complexities:</h4> <p><strong>What concepts should the audience be able to follow?</strong></p> <ul> <li>We have chosen to do this, initially, at the topic level. For example, audience members will be able to follow their favourite musical artists, news correspondents or even locations. As topics are representations of the concepts that the 麻豆约拍 is creating content on, this feels like a more natural fit given that new content will keep being added to inform audiences on. While standalone content items can be revised (often with minor or major edits) after publication or when the story develops, this feels like too granular a level for a user to specifically follow, making a save or read later function more appropriate. To 鈥楩ollow鈥 an article, for example, is more of a 鈥榬ead later鈥 function.</li> </ul> <p><strong>We need to understand how a 鈥榝ollow鈥 button should operate and how it changes state based on the current context it鈥檚 displayed within.</strong></p> <ul> <li>We need to consider if the audience member is already following a topic or not, if we need to offer the ability to unfollow that topic or not.</li> <li>We also need to consider preferences we鈥檝e already been given for other follow-like implementations and if it鈥檚 appropriate or not to use that data.</li> <li>It might also be applicable to present a follow count to indicate how popular a particular topic might be.</li> <li>We will also make the button mechanism available to our app ecosystem to keep the mechanism consistent across all our platforms.</li> <li>We need to build the logical components in such a way that they will operate in the same way regardless of where the follow button is presented.</li> <li>For example, if the follow button is offered on a Pan-麻豆约拍 topics page like <a href="/topics/c45r8dl5xxnt">Manchester</a> or <a href="/news/topics/cezlkpjzx2jt">Mark D鈥橝rcy</a> we must ensure the behaviour is consistent and intuitive.</li> <li>How will it work within accessibility guidelines?</li> <li>How will it work for our apps?</li> </ul> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p093wjjb.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p093wjjb.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p093wjjb.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p093wjjb.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p093wjjb.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p093wjjb.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p093wjjb.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p093wjjb.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p093wjjb.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p><strong>Legal constraints and how this will operate within the law</strong></p> <ul> <li>Considering GDPR and previous preferences provided to us by our audience members, can this new capability function within the law and if not, what must we do to ensure we are legally compliant and transparent to our audience?</li> </ul> <p><strong>How will this operate within the overall ecosystem of the 麻豆约拍 Online estate?</strong></p> <ul> <li>The 麻豆约拍 Online estate is complex and ever shifting. We must ensure we abide by open standards and reuse existing services to ensure a good fit today and for the future. Also, we must collaborate with teams internally to ensure our internal stakeholders are fully aware of our progress.</li> </ul> <p><strong>How will we ensure we don鈥檛 create echo chambers for audiences?</strong></p> <ul> <li>We need to carefully consider how we present followed content, so that it is presented coherently alongside both recommendations and related content.</li> <li>Followed content will be shown next to editorially curated selection, therefore, we will also need to consider how we deduplicate and prioritise content.</li> </ul> <p><strong>How do we maintain adequate levels of data to be functional and keep that minimal dataset secure?</strong></p> <ul> <li>We have a very robust data security process to satisfy. At every design step, security of the data is a key requirement.</li> </ul> <p><strong>What data do we need to record to adequately record a Follow action?</strong></p> <ul> <li>This data must then be easily parsed and follow our own internal standards and schemas.</li> <li>This data format must be compatible with the User Activity Service 鈥 our centralised location for storing user activity data 鈥 and be registered with it.</li> </ul> <p><strong>Where is the result of following one or more topics displayed? i.e. how are followed topics surfaced?</strong></p> <ul> <li>This is a key question for any product team. The 麻豆约拍page team will have different needs than for example, the Sport team. But, the underlying system must be able to cope with this in as an efficient manner as possible.</li> <li>A key consideration is how much should we develop for each delivery? We could work for a long period and deliver in a more big-bang approach, or does it make more sense to develop in small iterations accepting a changing interface and functionality to our audience?</li> </ul> <p><strong>How will this level of personalisation work within the WebCore stack鈥檚 caching strategies?</strong></p> <ul> <li>This is less of a concern for our team as we鈥檒l be using the outcome rather than heavily contributing to it. However, it is of great importance and so we will keep in close touch with the team working on it to ensure no assumptions are made about what is needed.</li> </ul> <p><strong>Do we have the technical capability to process a potentially complex set of follow activities?</strong></p> <ul> <li>Our 麻豆约拍 麻豆约拍 Data Capabilities Team is developing specific technology based on AWS ElasticSearch for this very purpose.</li> <li>This system will capture metadata about our content from various internal sources to allow us to query that metadata in novel ways to find the content we need on a per user basis.</li> </ul> <h4>How will the resulting follow actions be surfaced?</h4> <p>The Follow Product and User Experience team have broken down the approach into four separate phases:</p> <ol> <li>Audience members will be able to see what topics they鈥檙e currently following, follow new ones and choose to unfollow others from a topic index page or the 麻豆约拍 homepage.</li> <li>Our audience members will be able to see the topics they have followed across the News and Sport websites and apps. This will come with the new ability to manage what they have followed across the 麻豆约拍.</li> <li>Also, our audience will get the ability to follow a topic from the content level i.e. from within an article.</li> <li>Follow will become a unified approach across the 麻豆约拍 online estate helping breakdown any barriers between content and aiding discovery for all audience members.</li> </ol> <p>One outlet for follow is in alerts and notifications. There is a team working hard on understanding how best to achieve this whilst offering the most value to our audience. It鈥檚 a complex area so we鈥檙e keeping close on that work to ensure we鈥檙e ready for it.</p> <h4>Conclusion</h4> <p>It is early days for 鈥榝ollow鈥, but the ground work is already in place for it to be a great success. The 麻豆约拍 topics product is maturing and is already powering the majority of the Sport website and app. It also powers the 麻豆约拍 homepages and we鈥檙e currently working closely with the News team to help them bring the 麻豆约拍 News topics up to date and into the WebCore world.</p> <p>We鈥檝e detailed our key principles and considerations around the follow action and how we plan to implement it. It鈥檚 a challenging area to work within but all the building blocks are sliding into place, and we鈥檙e starting to look at how and when we can roll out the first phase. Following a topic is something people understand, as they鈥檙e used to doing it on social media.</p> <p>We鈥檙e working hard on bringing this functionality to our audiences as soon as we can make sure it meets the level of quality our audience expects from the 麻豆约拍. We aim to bring together all our content to all our audiences in such a way that the traditional silos of content are blurred to the point of invisibility.</p> </div> <![CDATA[Me, you and the machine]]> 2020-07-20T09:24:37+00:00 2020-07-20T09:24:37+00:00 /blogs/internet/entries/b2106d78-762c-403a-901a-2e34894c3ac1 Matthew Postgate <div class="component prose"> <p>We鈥檙e relying on a wide set of actions and tools to help us deal with the current pandemic. The 麻豆约拍 is playing its part to <a href="/mediacentre/latestnews/2020/bbc-keeping-nation-informed-educated-entertained">inform, educate and entertain</a>. And for us and others, digital technologies are playing a key role. In this blogpost, I discuss the 麻豆约拍鈥檚 approach to one of most important set of digital tools: 'machine learning'.</p> <p>The term machine learning (ML) covers a range of computer systems which learn from experience. With Covid-19, we know ML techniques are being used for contact mapping and predicting the effectiveness of drugs.</p> <p>One reason ML is being deployed here is that it is being deployed everywhere. Tools that can be trained on vast data sets and learn and improve as a result are behind social media feeds, computer vision and robotics, financial and weather models, and of course the improved machine translation and voice recognition systems that many of us use every day.</p> <p>Many of these areas are directly relevant to the 麻豆约拍 and its day-to-day operations. The Design and Engineering division I lead has been looking at them closely for some time, exploring ways in which machine learning can help us to enhance what the 麻豆约拍 offers our audiences.</p> <p>We believe that ML can help us respond to audience expectations, especially from 鈥榙igital native鈥 younger audiences. A key area is content discovery and recommendations. Audiences no longer accept having to put significant effort into searching for what they want. They want a personalised offer, which feels both relevant and fresh 鈥 something ML can help us to provide.</p> <p>And ML can help us innovate. There is potential to transform the ways we make programmes, the way we run as a business, and of course the ways we do our journalism. Examples include speeding up <a href="/rd/blog/2020-04-machine-learning-video-coding-optimisation">video compression</a> or finding ways of detecting and flagging disinformation.</p> <p>It's not surprising that we should be looking at ML in this way: the 麻豆约拍 has always worked with new technologies to offer the best user experience we can. This is why we created iPlayer and Sounds, and developed approaches like our Global Experience Language (<a href="/gel">GEL</a>), the 麻豆约拍鈥檚 shared design framework. As ML has developed, we have started to explore how to use the technologies responsibly and efficiently. We have also developed <a href="/blogs/internet/entries/4a31d36d-fd0c-4401-b464-d249376aafd1">a set of principles governing our deployment of ML technologies</a>.</p> <p>I want to be clear about where our ambitions lie. We are not Microsoft, Google or Baidu. We don't have their amounts of data, money or computing power. We are not aiming to compete with them by developing our own machine learning frameworks, or performing advanced research in novel algorithms.</p> <p>But the 麻豆约拍 is a fertile environment for applying ML techniques. We have unique types of problems to solve, and we have the ability as an organisation to draw from almost one hundred years of experience in storytelling. We are ambitious in the desire to explore the positive impact of applying ML to our operations.</p> <h4>What does this mean in practice?</h4> <p>The first thing we think through is whether a ML solution is needed. We then assess the <strong>benefits</strong> of each application to both individuals and society. An example would be designing the 麻豆约拍鈥檚 content recommendation engines to broaden our audience鈥檚 horizons. This is because we think there is both individual and public value in discovering new perspectives, music or experiences 鈥 not simply finding more of the same.</p> <p>We also ensure that we use our resources <strong>efficiently.</strong> ML requires a solid data platform and a consistent and modern approach to experimentation across our portfolio of products and services. It is important to maintain a central and coordinated approach so that, as an organisation, we can deploy scarce capability in the most effective way and optimise on learning quickly.</p> <p>We pair our Machine Learning capabilities with <strong>human judgement and diversity of experience</strong>. This applies both from a technology development perspective - where we bring together technical experts (e.g. data scientists, UX designers, product specialists) with editorial, policy, legal and R&D colleagues, and in terms of our audience experience - where the 麻豆约拍鈥檚 automated curation will sit alongside human curation.</p> <p>Finally, we recognise the need for <strong>collaboration and co-operation</strong> with other industries and organisations in maturing our approach with ML. Collaborations which allow media and technology companies to bring their expertise together in the public good will create more powerful experiences than anything we can do alone.</p> <p>Machine learning has enormous potential to transform not just the 麻豆约拍 but every other organisation. I want us to use it to connect with people more effectively, to bring out the strengths of our storytelling and to find new ways of communicating our trusted journalism. I hope the power of machines will help me and my colleagues create something new, compelling and distinctively 麻豆约拍 for each member of our audience.</p> </div> <![CDATA[How metadata will drive content discovery for the 麻豆约拍 online]]> 2020-04-15T13:53:12+00:00 2020-04-15T13:53:12+00:00 /blogs/internet/entries/eacbb071-d471-4d85-ba9d-938c0c800d0b Jonathan Murphy and Jeremy Tarling <div class="component prose"> <p><em>Jonathan Murphy, editorial lead for metadata and Jeremy Tarling, lead data governance specialist in Digital Publishing, explain what's being done to create a common metadata structure.聽</em></p> <p>The 麻豆约拍鈥檚 online portfolio has been built up over more than 20 years into a rich and varied collection of websites and services - but with all this content, it鈥檚 sometimes difficult trying to find some of it, let alone manage it all. As a result we have lots of hidden gems that aren鈥檛 being surfaced, and that鈥檚 something that we鈥檙e trying to fix with a new content discovery strategy.</p> <p>One of the challenges we鈥檙e facing up to as we rebuild our digital portfolio is how to make more of our content discoverable and personalised to more of our audiences. That鈥檚 particularly true of the under 35s age group who now have an array of competing platforms which do a great job in building algorithms to attract their attention.</p> <p>Underpinning this strategy of discovery and personalisation will be establishing more detailed metadata that describes all of our content using the same terminology and the same tools and data model.</p> <p>Metadata is the background info that describes the things we make. It can come in all forms, from technical metadata such as which camera was used in a film shoot to promotional metadata used to describe the plot of a programme. For the remit of this project, we鈥檙e focusing on what we call descriptive content metadata - tags that describe what an asset (e.g. an article, programme or TV/Audio clip) is about or who/what it mentions. That鈥檚 already used in areas like the 麻豆约拍 News and 麻豆约拍 Sport websites using data architecture called <a href="/blogs/bbcinternet/2012/04/sports_dynamic_semantic.html">Dynamic Semantic Publishing that was created for the London 2012 Olympics,</a> and now drives many thousands of subject-based aggregations, or Topic pages.</p> <p>There are also <a href="/sounds/categories/mixes">categories of programmes on 麻豆约拍 Sounds</a> and 麻豆约拍 iPlayer, which use a mixture of genres and formats contained in <a href="/blogs/bbcinternet/2009/02/what_is_pips.html">the PIPs database</a> that supports our vast online library of programme information. As a result of these two data silos, and their limitations, it鈥檚 difficult to offer audiences any pan-麻豆约拍 experiences or anything that requires an in-depth understanding of the content.</p> <h4>Common Metadata</h4> <p>So we need to go further than that, in order to offer material that covers both the breadth of our online content and to suit everyone鈥檚 tastes and needs. Here in the 麻豆约拍鈥檚 Digital Publishing team, we鈥檙e developing tools that make content description possible at all stages of production across our portffolio, and new vocabularies that allow for richer descriptions of our content.</p> <p>We鈥檝e already worked with the Sounds team, who have used our new tags and curations to create some of their <a href="/sounds/category/mixes">Music Playlists</a> which give you a soundtrack to suit your mood, whether that鈥檚 鈥榗hilled out鈥 鈥榝eel good鈥, music to dance to, or music to focus your mind.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p089knwq.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p089knwq.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p089knwq.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p089knwq.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p089knwq.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p089knwq.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p089knwq.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p089knwq.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p089knwq.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>To make this possible we鈥檝e developed a concept that every piece of content has a basic set of common metadata associated with it, that it carries around wherever it鈥檚 surfaced across the 麻豆约拍鈥檚 portfolio - whether that鈥檚 in Sounds, iPlayer or on the 麻豆约拍 News homepage. We鈥檙e storing this set of basic common metadata in what we call a 鈥楶assport鈥, and to create and manage this metadata we鈥檝e developed a tool called Passport Control.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p089kphl.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p089kphl.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p089kphl.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p089kphl.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p089kphl.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p089kphl.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p089kphl.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p089kphl.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p089kphl.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>Using the simple graph model first developed for the 麻豆约拍鈥檚 2012 Olympic鈥檚 coverage, we create subject-predicate-object triples to describe the nature of the relationship between an asset (the subject) and a tag (the object).</p> <p>In the above example this asset has been described as being 鈥渁bout鈥 some things - this kind of subject based tagging is well established at the 麻豆约拍, especially in our journalism output. But we have added three new predicates: 鈥渆ditorial tone鈥, 鈥渋ntended audience鈥, and 鈥済enre鈥.</p> <p>Each predicate can be used with an associated controlled vocabulary of terms. In some cases these controlled vocabularies are taxonomic hierarchies (like 麻豆约拍 genres) while in others they are simple lists of terms that we have developed to describe our output in ways that make sense to us and our audience.</p> <p>These new types of metadata can be used to make much richer collections of content, either as manual editorial curations or algorithmically generated recommendations.</p> <p>Our colleague Anna McGovern explains further some of those challenges we face at the 麻豆约拍 <a href="/blogs/internet/entries/887fd87e-1da7-45f3-9dc7-ce5956b790d2">here</a> in building our curations and recommendations, building on our public service values. With the amount and variety of material that we produce, from news articles to music mixes, live events to boxsets, we think we鈥檙e in a good position to provide content for all kinds of different tastes.</p> <p>We鈥檒l update you more about metadata developments, curations and recommendations as these features begin to roll out on 麻豆约拍 Online over the coming months.</p> </div> <![CDATA[Understanding public service curation: What do 鈥榞ood鈥 recommendations look like?]]> 2019-12-17T13:24:25+00:00 2019-12-17T13:24:25+00:00 /blogs/internet/entries/887fd87e-1da7-45f3-9dc7-ce5956b790d2 Anna McGovern <div class="component prose"> <p>Unless you know exactly what you鈥檙e after, finding the 麻豆约拍 content that is exactly right for you can be a little like looking for a needle in a haystack. Some challenges we face at the 麻豆约拍 are:</p> <p>1. We produce a vast quantity of content, at a rough estimate, around 2000 pieces of new content a day. In terms of our 鈥榮hop window鈥, we have a good number of promotional areas and slots 鈥 on homepages, in our apps, on our schedules and on our social media feeds 鈥 but simply not enough to accommodate all our content.</p> <p>2. When content makes it to a prominent promotional slot, it will, of course, get good traffic, but there is no guarantee that the person for whom that content is most relevant will see it at the time. And once that content loses its place it will be hard to find, indeed, no one will know it exists unless they are armed with determination and the greatest of digital search skills.</p> <p>3. Colleagues working in curation have an incredible ability to seek out the best content and ensure that it is labelled in a way that maximises its impact, but they simply can鈥檛 read, watch or listen to all of our output. There is just too much of it.</p> <p>So far, so familiar.</p> <p>Personalised recommendations, fuelled by the power of machine learning, are what every forward-thinking media and tech organisation is doing to put the content that audiences would most enjoy right in front of them. The 麻豆约拍 has automatic recommendations in <a href="http://www.bbc.co.uk/sounds">Sounds </a>, on <a href="http://www.bbc.co.uk/iplayer">iPlayer </a>and on some news language services like <a href="https://www.bbc.com/mundo">麻豆约拍 Mundo</a>, which use strategies like content similarity, popularity and collaborative filtering.</p> <p>The curatorial challenge for the 麻豆约拍 becomes more interesting and complex, because our guiding principles relate so strongly to delivering public service value to the audience. We cannot simply work out what would get the most clicks, and show that to our audiences (although ensuring content is attractive to our audiences is important). Instead, we are required:</p> <ul> <li>To provide impartial news and information to help people understand and engage with the world around them</li> <li>To support learning for people of all ages</li> <li>To show the most creative, highest quality and distinctive output and services</li> <li>To reflect, represent and serve the diverse communities of all of the United Kingdom鈥檚 nations and regions and, in doing so, support the creative economy across the United Kingdom</li> <li>To reflect the United Kingdom, its culture and values to the world</li> </ul> <p>These requirements, written into our <a href="http://downloads.bbc.co.uk/bbctrust/assets/files/pdf/about/how_we_govern/2016/charter.pdf%20">Royal Charter</a>聽are why editorial values are woven into our <a href="/blogs/internet/entries/4a31d36d-fd0c-4401-b464-d249376aafd1">Machine Learning Engine Principles</a> and why those working in machine learning <a href="/blogs/internet/entries/a38207dd-e4ed-40fa-8bdf-aebe1dc74c28.">work closely with editorial teams</a>.聽I recently conducted a deep exploration of what public service curation means to an organisation like the 麻豆约拍, so that we can begin to identify some of the signals that can help us as we build next level public service recommendations services.</p> <p>Around 80 editorial staff, involved in some way with digital curation or content creation, took part in the discussion. Around 40 Design & Engineering colleagues observed those discussions, mostly those working in areas related to data science and engineering, metadata and research. Five criteria for public service curation emerged:</p> <p>1. We want our content to reach and engage as large an audience as possible. We have a role in the national conversation by bringing the most important and resonant stories of the day to the attention of our audiences, as well as prioritising content, that is popular and has universal appeal for the greatest number of people. Examples of this might include content that brings people together like Strictly Come Dancing or content of global importance like Seven Worlds, One Planet, both of which have an impact on the national and cultural conversation. Popular and impactful content also represents good value for money.</p> <p>2. In a seemingly contradictory move, we don鈥檛 always optimise for peak popularity. We make and promote content that aims to appeal to different audiences, groups, communities, regions and perspectives. We want to showcase content that feels personal. Take the podcast Netballers, which is about a sport with less national impact than, say, Premier League football. In terms of feeling relevant, it鈥檚 a win for women, young and BAME audiences. And it reflects an aspect of British culture. By its nature, it doesn鈥檛 get as many downloads as the Peter Crouch podcast, but it serves to inform and entertain those with a passion for netball.</p> <p>3. We strive always to bring audiences something new - newly released music, new writers, new presenters, untold stories, news about events and stories of national significance, information about emerging technologies, research, discoveries, perspectives, major drama series, ways of storytelling. This quest for the new is the kindling that starts the national and cultural conversation in the first place. The 麻豆约拍 produces a lot of this type of content, but it is not easily identified by engines which are based on collaborative filtering alone*.</p> <p>4. We provide useful, helpful, practical information, explainers and fact checks, which makes us a trusted source of information: that could be a news story, or revision notes for a Chemistry GCSE, or recipes for a quick mid-week meal, or sports results.</p> <p>5. We have enormous breadth and depth which means that there is something for everyone: there is variety by topic, tone, format, duration, location, level of expertise and age suitability and relevance.</p> <p>Our long term ambition is to use this thinking to build recommendation systems which can broaden our audience鈥檚 horizons, providing different perspectives and stories and experiences that they might not otherwise have come across. But this is complex - and a recommender that can effectively deal with as multi-faceted an editorial issue as impartiality is extremely challenging and will take significant time to develop.</p> <p>For now, curation in the context of our recommenders involves upholding our editorial values and finding ways to surface the most relevant and compelling content for each user. All the content the 麻豆约拍 makes for our UK audiences, one way or another, is public service, so our recommendations will of course always have a public service flavour.</p> <p>I鈥檓 excited that we have already built models with business rules about increasing breadth (in the Sounds recommender) and depth (in the World Service recommenders), and as well as reflecting editorial values around sensitivity. For example, on the Sport recommender, due for release next year, we鈥檝e taken the curatorial decision that content from rival teams will not be shown together in a set of recommendations.</p> <p>We can learn so much about public service by sharing the editorial point of view as we iterate and refine our approach. I鈥檒l be working in collaboration with editorial colleagues and data scientists to ensure these public service curation criteria inform the 麻豆约拍鈥檚 future recommendations engines.</p> <p>Lastly, we also recognise that the power of machine learning - of which recommendations is a part - can only get us so far. Machines cannot understand all the subtleties, complexities and nuance of editorial decision making. An algorithm will have trouble identifying what is entertaining or fresh or authentic without significant human assistance. A machine can help only up to a point to accurately tag content in a metadata system before a human verifies the machine鈥檚 choices and hits 鈥榩ublish鈥.</p> <p>So at the 麻豆约拍, we鈥檒l be maintaining a human hand in content creation and discovery. We need both humans and machines to best serve our audiences; editorial colleagues are highly skilled at making and promoting our content and machines can help amplify those skills. More and more, what recommendations can do is help locate the needle in the haystack in the first place - and exactly the right needle for you.</p> <p>*Which is why we have built a factorisation machine for our Sounds recommender which combines collaborative filtering with content matching. For more detail see <a href="/blogs/internet/entries/3e4342d4-6f81-47c0-8ba2-8dc7b419eb72">Developing personalised recommendation systems at the 麻豆约拍</a>.聽</p> </div> <![CDATA[Scaling responsible machine learning at the 麻豆约拍]]> 2019-10-04T09:32:44+00:00 2019-10-04T09:32:44+00:00 /blogs/internet/entries/4a31d36d-fd0c-4401-b464-d249376aafd1 Gabriel Straub <div class="component prose"> <p>Machine learning is a set of techniques where computers can 鈥榮olve鈥 problems without being explicitly programmed with all the steps to solve the problem, within the parameters set and controlled by data scientists working in partnership with editorial colleagues.</p> <p>The 麻豆约拍 currently uses machine learning in a range of ways 鈥 for example to provide users with personalised content recommendations, to help it understand what is in its vast archive, and to help transcribe the many hours of content we produce. And in the future, we expect that machine learning will become an ever more important tool to help the 麻豆约拍 create great audience experiences.</p> <p>The 麻豆约拍 was founded in 1922 in order to inform, educate and entertain the public. And we take that purpose very seriously. We are governed by our <a href="https://www.bbc.com/aboutthebbc/governance/charter">Royal Charter</a>聽and public service is at the heart of everything we do. This means that we act on behalf of our audience by giving them agency and that our organisation exists in order to serve individuals and society as a whole rather than a small set of stakeholders.</p> <p>With Machine Learning becoming a more prevalent aspect of everyday life, our commitment to audience agency is reflected in this area as well. And so in 2017, we submitted <a href="http://data.parliament.uk/writtenevidence/committeeevidence.svc/evidencedocument/artificial-intelligence-committee/artificial-intelligence/written/70493.pdf%20">a written commitment to the House of Lords Select Committee on Artificial Intelligence</a>聽in which we promised to be leading the way in terms of responsible use of all AI technologies, including machine learning.</p> <p>But what does this mean in practice?</p> <p>For the last couple of months, we have been bringing together colleagues from editorial, operational privacy, policy, research and development, legal and data science teams in order to discuss what guidance and governance is necessary to ensure our machine learning work is in line with that commitment.</p> <p>Together, we agreed that the 麻豆约拍鈥檚 machine learning engines will support public service outcomes (i.e. to inform, educate and entertain) and empower our audiences.</p> <p>This statement then led to a set of <strong>麻豆约拍 Machine Learning Principles</strong>:</p> <h4>The 麻豆约拍鈥檚 Values</h4> <p>1. The 麻豆约拍鈥檚 ML engines will reflect the values of our organisation; upholding trust, putting audiences at the heart of everything we do, celebrating diversity, delivering quality and value for money and boosting creativity.</p> <h4>Our Audiences</h4> <p>2. Our audiences create the data which fuels some of the 麻豆约拍鈥檚 ML engines, alongside 麻豆约拍 data. We hold audience-created data on their behalf, and use it to improve their experiences with the 麻豆约拍.</p> <p>3. Audiences have a right to know what we are doing with their data. We will explain, in plain English, what data we collect and how this is being used, for example in personalisation and recommendations.</p> <h4>Responsible Development of Technology</h4> <p>4. The 麻豆约拍 takes full responsibility for the functioning of our ML engines (in house and third party). Through regular documentation, monitoring and review, we will ensure that data is handled securely. And that our algorithms serve our audiences equally and fairly, so that the full breadth of the 麻豆约拍 is available to everyone.</p> <p>5. Where ML engines surface content, outcomes are compliant with the 麻豆约拍鈥檚 editorial values (and where relevant as set out in our editorial guidelines). We will also seek to broaden, rather than narrow, our audience鈥檚 horizons.</p> <p>6. ML is an evolving set of technologies, where the 麻豆约拍 continues to innovate and experiment. Algorithms form only part of the content discovery process for our audiences, and sit alongside (human) editorial curation.</p> <p>These principles are supported by a checklist that gives practitioners concrete questions to ask themselves throughout a machine learning project. These questions are not formulated as a governance framework that needs to be ticked off, but instead aim to help teams building machine learning engines to really think about the consequences of their work. Teams can reflect on the purpose of their algorithms; the sources of their data; our editorial values; how they trained and tested the model; how the models will be monitored throughout their lifecycle and their approaches to security and privacy and other legals questions.</p> <p>While we expect our six principles to remain pretty consistent, the checklist will have to evolve as the 麻豆约拍 develops its machine learning capabilities over time.</p> <p>The <a href="https://findouthow.datalab.rocks/">Datalab team</a>聽is currently testing this approach as they build the 麻豆约拍鈥檚 first in-house recommender systems, which will offer a more personalised experience for 麻豆约拍 Sport and 麻豆约拍 Sounds. We also hope to improve the recommendations for other products and content areas in the future. We know that this framework will only be impactful if it is easy to use and can fit into the workflows of the teams building machine learning products.</p> <p>The 麻豆约拍 believes there are huge benefits to being transparent about how we鈥檙e using Machine Learning technologies. We want to communicate to our audiences how we鈥檙e using their data and why. We want to demystify machine learning. And we want to lead the way on a responsible approach. These factors are not only essential in building quality ML systems, but also in retaining the trust of our audiences.</p> <p>This is only the beginning. As a public service, we are ultimately accountable to the public and so are keen to hear what you think of the above.</p> </div> <![CDATA[Navigating the data ecosystem technology landscape]]> 2019-09-03T12:46:36+00:00 2019-09-03T12:46:36+00:00 /blogs/internet/entries/67fee994-3d20-45d5-be2a-acfc47d572f1 Hannes Ricklefs, Max Leonard <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p07mb9b5.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p07mb9b5.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p07mb9b5.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p07mb9b5.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p07mb9b5.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p07mb9b5.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p07mb9b5.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p07mb9b5.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p07mb9b5.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>Credit: Jasmine Cox</em></p></div> <div class="component prose"> <p>Want to message your Facebook friends on Twitter? Move your purchased music from iTunes to Amazon? Get Netflix recommendations based on your iPlayer history? Well, currently you can鈥檛.</p> <p>Many organisations are built on data, but the vast majority of the leading players in this market are structured as vertically integrated walled gardens, with few (if any) meaningful interfaces to any outside services. There are a great number of reasons for this, but regardless of whether they are intentional or technological happenstance (or a mixture of both), there is a rapidly growing movement of GDPR supercharged technologists who are putting forward <a href="/news/technology-45706429">decentralised and open alternatives</a> to the <a href="https://www.intricity.com/data-science/what-is-a-data-moat/">data-moated</a> household names of today. For the 麻豆约拍 in particular, these new ways of approaching data are well aligned with our public service ethos and commitment to treating data in the most ethical way possible.</p> <p>Refining how the 麻豆约拍 uses data, both personal and public, is critical if we are to create <a href="/mediacentre/speeches/2017/tony-hall-annual-plan#heading-a-personalised-uniquely-tailored-bbc">a truly personalised 麻豆约拍</a> in the near term and essential if we want to remain relevant in the coming decades. Our Chief Technology and Product Officer Matthew Postgate <a href="/blogs/internet/entries/78948980-e1e6-48fe-918a-c9bb5f2a0719">recently spoke about the 麻豆约拍鈥檚 role within data-led services</a>, in which he outlined some of the work we have been doing in this respect to ensure the 麻豆约拍 and other public service organisations are not absent from new and emerging data economies.</p> <p>Alongside focused technical research projects like the <a href="/rd/blog/2019-06-bbc-box-personal-data-privacy">麻豆约拍 Box</a>, we have been mapping the emerging players, technologies and data ecosystems to further inform the 麻豆约拍鈥檚 potential role in this emerging landscape. Our view is that such an ecosystem is made up of the following core capabilities: Identity, data management (storage, access, and processing), data semantics and the developer experience, which are currently handled wholesale in traditional vertical services. A first step for us is hence to ascertain which of these core capabilities can realistically be deployed in a federated, decentralised future, and which implementations currently exist to practically facilitate this.</p> <p><strong>Identity</strong>, a crucial component of the data ecosystem, proves who users say they are providing a true digital identity. Furthermore we expect standard account features such as authentication and sharing options via unique access token that could enable users to get insights or to share data to be part of any offering. We found that identity, in the context of proving a user鈥檚 identity, was not provided by any of the solutions we investigated. Standard account features were present, ranging from platform specific implementations, to decentralised identifier approaches via WebID, and blockchain based distributed ledger approaches. As we strongly believe it is important to prove a user is who they say they are, at this point we would look to integrate solutions that specialise in this domain.</p> <p><strong>Data management</strong> can be further broken down into 3 areas:</p> <ol> <li><strong>Data usage and access</strong>, involves providing integration of data sources with an associated permission and authorisation model. Users should have complete governance of their data and usage by data services. Strong data security controls and progressive disclosure of data are key here. Given our investigation is based around personal data stores (PDS) and time series sensor/IoT device data platforms to capture personal, public and open data, providing access and controls around sharing of data was a fundamental capability of all offerings. All of them provided significant granularity and transparency to the users about what data is being stored, its source and usage by external services.</li> <li><strong>Data storage</strong> must provide high protection guarantees of users鈥 data, encrypted in transit and at rest, giving users complete control and transparency of data lifecycle management. Again, this is a fundamental requirement, such that storage is either a core offering of any platform or outsourced to external services that store data in strongly encrypted formats.</li> <li><strong>Data processing</strong> mechanisms to allow users to bring 鈥渁lgorithms鈥 to their data, combined with a strong contract based exchange of data. Users are in control and understand what insights algorithms and services derive from their data. These might include aspects such as the creation of reports, creation and execution of machine learning models, other capabilities that reinforce the user鈥檚 control over how their personal data is used for generated insights. Through contract and authorisation based approaches users have complete audit trails of any processing performed which provides transparency of how data is utilised by services, whilst continuously being able to detect suspicious or unauthorised data access. Our investigations found that processing of data is either through providing SDKs that heavily specify the workflow for data processing, or no provisioning at all, leaving it to developers to create their own solution.</li> </ol> <p><strong>Data model and semantics聽</strong>refers to mechanisms that describe (schemas, ontologies) and maintain the data domains inside of the ecosystem, which is essential to provide extensibility and interoperability. Our investigations found this being approached in a wide spectrum from:</p> <ol> <li>no provision requiring developers to come to conclusions about the best way to proceed</li> <li>using open standards such as schema.org and modeling data around linked data and RDF</li> <li>completely proprietary definitions around schemas within the system.</li> </ol> <p>Finally the <strong>developer experience</strong> is key. It requires a set of software development tools to enable engineers to develop features and experiences as well as being able to implement unique value propositions required by services. This is the strongest and most consistent area across all our findings.</p> <p>In summary our investigations have shown that there is no one solution that provides all of our identified and required capabilities. Crucially the majority of the explored end user solutions are still commercially orientated, such that they either make money from subscribers or through associated services.</p> <p>So with the number of start-ups, software projects and standards that meet these capabilities snowballing, where might the 麻豆约拍 fit into this increasingly crowded new world?</p> <p>We believe that the 麻豆约拍 has a role to play in all of these capabilities and that it would enhance our existing public service offering: to inform, educate and entertain. A healthy ecosystem requires multiple tenants and solutions providers, all adhering to core values such as transparency, interoperability and extensibility. Only then will users be able to freely and independently move or share their data between providers which would enable purposeful collaboration and fair competition toward delivering value to audiences, society and industry.</p> <p>The 麻豆约拍 was incorporated at the dawn of the radio era to counteract the unbridled free-for-all that often comes with any disruptive technology, and <a href="/rd/about/our-purpose">its remit to shape standards and practices </a>for the good of the UK and its population stands today as <a href="http://downloads.bbc.co.uk/historyofthebbc/1920s.pdf">it did in 1927</a>. With a scale, reach and purpose that is unique to the 麻豆约拍, it is strongly congruent with our public service duty to help drive policy, standards and access rights to ensure that the riches on offer in these new ecosystems are not coopted solely for the downward pursuit of profit, and remain accessible for the benefit of all.</p> </div> <![CDATA[Machine learning and editorial collaboration within the 麻豆约拍]]> 2019-08-29T13:55:00+00:00 2019-08-29T13:55:00+00:00 /blogs/internet/entries/a38207dd-e4ed-40fa-8bdf-aebe1dc74c28 Anna McGovern, Ewan Nicolson, Svetlana Videnova <div class="component prose"> <p>The 麻豆约拍 is nearly 100 years old. Inevitably, as an organisation we are having to adapt to meet some of the technological requirements of the future, such as incorporating Machine Learning (ML) technologies. ML recommendations, for example, is a standard way for audiences to discover content and the 麻豆约拍 is committed to make this discovery more personal. Developing these services has brought an interesting opportunity for collaboration between the ML and Editorial teams within Datalab, the 麻豆约拍 team focused on building recommendation engines.</p> <p>About a year ago we started the experiment of the <a href="/blogs/internet/entries/82cd8d1e-2f23-4eff-8f34-0ef38ca8854c">麻豆约拍+ app</a>. This was the first time the 麻豆约拍 provided the audience with a fully automated ML service. With this wealth of knowledge and with more data science initiatives taking shape, we want to use all the available expertise the 麻豆约拍 can provide.</p> <p>Our aim is to create responsible recommendation engines, true to the 麻豆约拍 values and using all available expertise the 麻豆约拍 can provide. In industry, it is commonplace for data science teams to make use of specialist knowledge to inform how models are developed. For example, data scientists working for a travel site would use experts with knowledge about everything from business flights to how and when families go on holiday. Datalab consulted editorial teams and representatives who specialised in curation as it began to develop recommendations for content discovery.</p> <p>Datalab鈥檚 editorial lead, Anna McGovern, helps us with advice on editorial judgement and content curation expertise within the 麻豆约拍. Ewan Nicolson is lead data scientist and represents the technological aspect of Datalab鈥檚 work here. Svetlana Videnova, Business Analyst, poses some of the common teamwork problems within the public media industry and technological challenges we face today. We will focus on a given challenge about the curation of the content and leave its creation phase for another post. Both Anna and Ewan will provide their way of tackling that work in their own fields. The last column of the table below demonstrates an example of how the collaboration works in our team.</p> <p>As you鈥檒l see, the two fields of editorial and data science compliment each other. Working across discipline gives better results for the audience, and helps us learn from each other. It means that machine learning is actually solving the correct problems because we鈥檙e making use of the rich expertise from editorial. It also means that editorial are able to take advantages of techniques like machine learning to multiply their efforts and deliver more value to the audience.</p> </div> <div class="component prose"> <table width="554" border="1" cellpadding="0"> <tbody> <tr> <td valign="top" width="98"> <p><strong>Challenge</strong></p> </td> <td valign="top" width="126"> <p><strong>Machine Learning solution</strong></p> </td> <td valign="top" width="147"> <p><strong>Editorial solution</strong></p> </td> <td valign="top" width="175"> <p><strong>When we collaborate</strong></p> </td> </tr> <tr> <td valign="top" width="98"> <p>How聽do聽we ensure curation is a good experience for users?</p> </td> <td valign="top" width="126"> <p>We consider many different measures of success: accuracy, diversity, recency, impartiality, editorial priority.</p> </td> <td valign="top" width="147"> <p>Traditionally on an editorial team, a journalist would research a story, discuss how it might be covered and compose the story itself to make it compelling.聽</p> </td> <td valign="top" width="175"> <p>The data scientists get a rich understanding from editorial of the different trade-offs between these measures of success. Deep domain knowledge.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How does recency impact curation of content?</p> </td> <td valign="top" width="126"> <p>We include publication date as a feature in our models. We sometimes try and optimise for recency, showing people more current content in some situations.</p> </td> <td valign="top" width="147"> <p>One of the challenges is that once that work is done it is fairly hard to bring the editorial creation back to life, especially for evergreen content. This is one of many examples that ML recommendations could help聽with, by surfacing this content in the most relevant time according to the user鈥檚 experience or history.聽</p> </td> <td valign="top" width="175"> <p>By working together we鈥檙e able to identify how to make decisions about which pieces of content are evergreen and suitable for recommendation, and which pieces have a limited shelf-life and shouldn鈥檛 be presented to users beyond a certain point.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How does the 麻豆约拍 ensure impartiality?聽</p> </td> <td valign="top" width="126"> <p>We use聽<a href="https://5harad.com/papers/fair-ml.pdf">measures of statistical fairness</a>聽to understand if our model is giving unbiased results.</p> <p>聽</p> <p>Good practice in machine learning make sure that we鈥檙e using unbiased training data.</p> <div align="center"><hr align="center" size="1" width="69" /></div> </td> <td valign="top" width="147"> <p>Editors,聽journalists and content creators make a concerted effort聽to ensure that a range of views and perspectives are shown within a piece of content or across several pieces of content(within聽a series for example)</p> </td> <td valign="top" width="175"> <p>We combine our good practices with domain knowledge from editorial. We use techniques like聽human-in-the-loop machine learning, or semi-supervised learning聽to make editorial鈥檚 lives easier, and apply their knowledge at massive scale.</p> <p>聽</p> <p>ML helps editorial聽identifying those pieces of content that show a breadth of views.聽</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How we ensure聽variety聽within content serving?</p> </td> <td valign="top" width="126"> <p>We construct mathematical measures for聽<a href="http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.221.6277&rep=rep1&type=pdf">novelty and diversity.</a>聽We include these in our machine learning optimisations.</p> <p>聽</p> </td> <td valign="top" width="147"> <p>Editorial staff responsible for curation聽ensure聽a breadth and depth of content on indexes, within collections etc</p> </td> <td valign="top" width="175"> <p>We learn about the differences between our different pieces of content. Working together we鈥檙e able to determine if our recommendations offer an interesting, relevant, and useful journey for the user.聽</p> <p>聽</p> <p>The 麻豆约拍鈥檚 audio networks feature different output and tone of voice. ie.聽Radio 4 has a very different 鈥榝lavour鈥櫬爐o 6Music. Consequently network can be used to ensure variety in results.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How do we avoid聽legal issues?聽</p> </td> <td valign="top" width="126"> <p>We are given a checklist, and we check the items off. We get told that there are things聽鈥渨e can鈥檛 do for opaque legal reasons鈥 but never really understand why, and limit the functionality of our solution.</p> <p>聽</p> </td> <td valign="top" width="147"> <p>Editors, journalists and content creators have to attend a mandatory course relating to media law, so that they have full knowledge about issues such as contempt of court, defamation and privacy. An editor will sign off content to ensure that content is compliant with legal requirements.聽</p> </td> <td valign="top" width="175"> <p>By talking to legal advisers we can build business rules to minimise the risk of legal infractions.聽</p> <p>聽</p> <p>Close collaboration with editorial means we gain a deep understanding of the potential problems ahead at an early stage. We build with awareness of these concerns, and with that awareness build a solution that is high quality from both a technical and editorial point of view.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How we聽handle editorial quality?</p> </td> <td valign="top" width="126"> <p>We build and refine a model using data science good practices, and then turn it over to our editorial colleagues. They then decide if the results are good or not.</p> </td> <td valign="top" width="147"> <p>When editors curate they can choose content that is relevant, interesting and of good quality.聽</p> <p>聽</p> <p>聽</p> <p>Recommendations present a specific editorial challenge,聽in that recommenders can surface content that is not the best of our output.聽</p> </td> <td valign="top" width="175"> <p>In聽麻豆约拍+聽we prioritised content that we knew would suit the environment in which it appeared: standalone, short-form videos, appearing in a feed, from digital first areas such as Radio 1, The Social, 麻豆约拍 Ideas etc</p> <p>聽</p> <p>Including editorial throughout the process means that they teach us about what is important in the results, so that data science understand the real problems that we鈥檙e trying to solve.</p> <p>聽</p> <p>We fail quickly, and learn quickly, getting to a better quality result.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How we learn from our audiences? Accuracy/user generated content?</p> </td> <td valign="top" width="126"> <p>Measure user activity with the products, and construct measurements of engagement.</p> <p>聽</p> <p>Building implicit and explicit feedback loops. An explicit feedback loop is having a鈥渓ike鈥澛燽utton, an implicit feedback loop is determining a way to measure when something has gone wrong, like bounce rate or user churn.</p> <p>聽</p> </td> <td valign="top" width="147"> <p>We monitor feedback and analyse stats to build a picture about how our audiences engage with our content.聽</p> </td> <td valign="top" width="175"> <p>We work with editorial to understand the insights we get from data. They help rationalise the behaviours that we see in the data. They also teach us things that we should look for in the data.</p> </td> </tr> <tr> <td valign="top" width="98"> <p>How we test recommendations</p> </td> <td valign="top" width="126"> <p>A mixture of offline evaluation metrics聽(e.g.testing against a known test set of data), and online evaluation metrics聽(e.g.A/B testing)</p> <p>聽</p> </td> <td valign="top" width="147"> <p>Traditionally: We monitor feedback and analyse stats to build a picture about how our audiences engage with our content.聽</p> </td> <td valign="top" width="175"> <p>The editorial lead works with data scientists on the composition of the recommender. The results are then reviewed by the editorial lead and to obtain a variety of opinions the results are reviewed by more editorial colleagues.聽</p> <p>聽</p> <p>More on quantitative testing聽<a href="/blogs/internet/entries/3e4342d4-6f81-47c0-8ba2-8dc7b419eb72">here</a><strong>聽.</strong></p> <p>聽</p> <p>The rich editorial feedback lets us understand where our model could be better and make improvements.</p> </td> </tr> </tbody> </table> </div> <div class="component prose"> <p>We鈥檙e big believers in cross-disciplinary collaboration. As we鈥檝e touched on in this article the 麻豆约拍 has a lot of uniquely complex problems to solve in this space. This collaboration is essential if we鈥檙e going to continue to deliver value to the 麻豆约拍鈥檚 audience using data.</p> <p>If you are curious about this collaboration and would like to know more in depth about how we work, leave us a message and we will be happy to get back to you.</p> <p>Also, we are hiring <a href="https://findouthow.datalab.rocks/">https://findouthow.datalab.rocks/</a>.</p> </div> <![CDATA[Developing personalised recommender systems at the 麻豆约拍]]> 2019-08-22T14:20:43+00:00 2019-08-22T14:20:43+00:00 /blogs/internet/entries/3e4342d4-6f81-47c0-8ba2-8dc7b419eb72 Jana Eggink <div class="component prose"> <p>The 麻豆约拍 is on a journey to become more personalised, and recommendations are an important part of that goal. To date, recommendations in the 麻豆约拍 have been provided primarily by external providers. We feel that offering 鈥 and understanding 鈥 good recommendations is a crucial area for us in reaching our target audience of young listeners and so we have started exploring this area in-house. The <a href="https://findouthow.datalab.rocks">Datalab team</a> is a relatively new team specialising in machine learning, and looking after recommender systems in the 麻豆约拍. We work with product groups to develop new ways to personalise their offerings, and also collaborate with <a href="/rd">麻豆约拍 R&D</a>.</p> <p>We want to be able to explain the composition of our recommendations and so we need to understand how they are generated. Our recommendations should reflect the breadth and diversity of our content and meet our editorial guidelines, as well as informing, educating and entertaining! All these were good reasons for us to build the capability to constantly create challengers to the existing recommendation models.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p07l9x41.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p07l9x41.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p07l9x41.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p07l9x41.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p07l9x41.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p07l9x41.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p07l9x41.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p07l9x41.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p07l9x41.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>Current recommendations on Sounds website</em></p></div> <div class="component prose"> <p>Datalab was assigned this brilliant and fun challenge and began collaborating with the Sounds team, using a multidisciplinary group made up of data scientists, engineers, editorial specialists and product managers.</p> <p>The team had some prior experience building personalised recommendations for our video clip <a href="/blogs/internet/entries/82cd8d1e-2f23-4eff-8f34-0ef38ca8854c">app 麻豆约拍+</a>. For 麻豆约拍+, the recommender was purely content based, using existing metadata information such as genres (e.g. Drama/Medical) or brands (e.g. Glastonbury Festival). This would probably have been a good approach if our content had been labelled for the express purpose of personalisation. However, the 麻豆约拍鈥檚 production workflows were designed to meet the needs of broadcast systems, and we didn鈥檛 always have all the labels we would have wanted for recommendations.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p07l9xqd.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p07l9xqd.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p07l9xqd.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p07l9xqd.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p07l9xqd.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p07l9xqd.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p07l9xqd.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p07l9xqd.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p07l9xqd.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>麻豆约拍+ app</em></p></div> <div class="component prose"> <p><a href="https://www.csie.ntu.edu.tw/~b97053/paper/Rendle2010FM.pdf">Factorisation Machines</a> come with the enticing promise of combining content-based recommendations with collaborative filtering.</p> <p>Using a standard content-based approach, if a user had listened to podcasts from the genre 鈥楬ealth & Wellbeing鈥 the system would recommend a new episode from Radio 1鈥檚 Life Hacks but it could also recommend Radio 4鈥檚 Inside Health, which has a very different tone of voice. By contrast, collaborative filtering matches programmes based on what similar users have enjoyed 鈥 so if they listen to Radio 1鈥檚 Life Hacks, they might be recommended Radio 1 comedy. This model relies on 鈥榓djacent鈥 content similar to the recommendations found in shopping websites where 鈥榗ustomers who bought this also bought that鈥. This approach often leads to better recommendations for established content, but is less effective for fresh content that hasn鈥檛 been consumed by significant numbers of people. Since the 麻豆约拍 continuously produces new content throughout the day this recommendation strategy by itself would be limiting.</p> <p>Factorisation machines are a smart way to combine both. They have been around a few years, and open source toolboxes exist to support them. Our team programs primarily in Python, so we wanted a toolbox that integrates with that. Obviously, we also wanted it to be fast, give superior results and be easy to use (more on that later鈥).</p> <p>We stored user-item interactions (i.e. the programmes a specific user has listened to) in a BigQuery table. The programme items with the corresponding genre and brand metadata were in a different table, and both needed to be assembled in the correct format for the factorisation machines. Our first choice of toolbox was xlearn. The code seemed relatively mature, running a first test example was easy, and the toolbox offers a variety of different flavours in terms of learning algorithm. But it was hard to get the data into the correct format and, even now that we have a version up and running, we鈥檙e still not sure we got everything right 鈥 mainly because the initial results are nowhere near as good as we had wanted (and expected) them to be!</p> <p>The quality of recommendations can be subjective and we needed a way to test them before making them live anywhere on the 麻豆约拍鈥檚 websites or apps. Predicting past behaviour is one way of doing this, but also comes with all sorts of problems: users only click on what they see, a piece of content might be brilliant, but if it does not appear in the results, the user will not see it and cannot click on it. Recommending the most popular items generally gives good numbers (as by definition these items get the most clicks), but rarely leads to recommendations of fresh content. In practical terms, it鈥檚 also a lot of work to set up if your data is stored in ways that were not devised with the ease of access for data scientists in mind鈥</p> <p>So we decided to test the results using qualitative evaluation, asking about 20 editorial and other non-technical people to judge our new recommendations against those from an existing provider. We didn鈥檛 tell them which set came from which recommender! We used the individual history of the internal test participants to generate the recommendations by both providers and asked for their preference and general feedback.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p07lbl35.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p07lbl35.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p07lbl35.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p07lbl35.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p07lbl35.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p07lbl35.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p07lbl35.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p07lbl35.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p07lbl35.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""><p><em>Qualitative experiment</em></p></div> <div class="component prose"> <p>Most of our test users preferred the recommendations we currently have live to our first set of test recommendations and we weren鈥檛 keen on them either, so we knew we had more work to do.</p> <p>With the overall infrastructure set-up, it was quite easy to swap out the toolbox we鈥檝e used for the factorisation machines. We had previously looked at <a href="https://lyst.github.io/lightfm/docs/home.html">lightFM,</a> and it had a much simpler data format, so we decided to give it a go. We were able to compute new recommendations and run another qualitative experiment in less than two weeks. Our recommendations looked much better, and our test users agreed! However, these are still first results. We don鈥檛 feel we鈥檝e fully solved the problem of recommending popular items versus programmes that are strongly tailored towards a specific user鈥檚 interests, and are looking into ways to improve this.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p07lbmwq.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p07lbmwq.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p07lbmwq.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p07lbmwq.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p07lbmwq.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p07lbmwq.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p07lbmwq.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p07lbmwq.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p07lbmwq.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>We are happy with the results so far, but there is still a lot of work to do to bring the recommender into production. The infrastructure needs decidedly more work to make it robust and able to scale, and we鈥檇 like to do more testing. Having a variety of offline metrics should help us to optimise parameters, and test new algorithms without having to go back to our testing panels every few days. We鈥檙e also still looking at a simple content-based recommender to have another baseline, so more results hopefully soon.</p> <p>We also still have some more fundamental questions that we hope our practical work will help us to answer. For example, can we use the same approach for recommending entertainment as for news, or do we need specialised systems for each domain? And what if we change the medium and move from audio and video to text, or new interfaces like voice controlled devices? Even if the overall editorial guidelines do not change, we might need different technical approaches to be able to achieve them. But we also want to avoid starting from scratch for every new recommender we build, and we鈥檙e still trying to figure out how best to do that. In summary, there is lots to do, but it鈥檚 exciting and we鈥檙e enjoying the challenge!</p> <p>Want to work with us? <a href="https://findouthow.datalab.rocks/">https://findouthow.datalab.rocks/</a></p> </div> <![CDATA[Personalisation: Is there a price for convenience?]]> 2018-10-19T12:52:34+00:00 2018-10-19T12:52:34+00:00 /blogs/internet/entries/b79885ce-811e-41c0-9e67-0576fe4f5dfc Sinead O'Brien <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p06p8wg8.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p06p8wg8.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p06p8wg8.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p06p8wg8.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p06p8wg8.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p06p8wg8.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p06p8wg8.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p06p8wg8.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p06p8wg8.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p><em>Sinead O'Brien, Technology Strategy & Architecture's Lead Project Manager for Transformation Delivery shares her insights from this month's 麻豆约拍 Machine Learning Fireside Chat.</em></p> <p>As more and more of our intimate data is collected, is there a price of convenience? If so, what is it, and is it worth paying? The decidedly thought-provoking discussion at last week鈥檚 sold out '麻豆约拍 Machine Learning Fireside Chats presents: The Price of Convenience鈥 was hosted by Ahmed Razek of 麻豆约拍 Blue Room.</p> <p><strong>The provocation鈥</strong><br />There is an increasingly fine line between personalised services and invasive services. Do people understand that they鈥檙e trading their personal data for these services? Are they aware of the risks? Do they care?</p> <p><strong>On the panel鈥</strong><br />Maxine Mackintosh, PhD student at <a href="https://www.turing.ac.uk/">The Alan Turing Institute</a>. Maxine鈥檚 PhD involves mining medical records for new predictors of dementia. She is passionate about understanding how we might make better use of routinely collected data to improve our cognitive health.</p> <p>Also on the stellar line-up was Josh Cowls, Research Associate in Data Ethics at The Alan Turing Institute, and a doctoral researcher at the Digital Ethics Lab, Oxford Internet Institute. Josh's research agenda centres on decision-making in the digital era, with a particular focus on the social and ethical impact of big data and AI and its intersection with public opinion and policy-making.</p> <p>The third guest speaker for the evening, Martin Goodson, is Chief Scientist and CEO of <a href="https://evolution.ai/">Evolution AI</a>. Martin is a specialist in natural language processing, the computational understanding of human language.</p> <p><strong>The discourse鈥</strong></p> <p>Maxine kicked off the conversation with a rather hard-hitting statement, that we misunderstand what 鈥渉ealth data鈥 really means. When we discuss health data, it is presumed that we refer to data that is collected when we interact with the health system 鈥 our medical records. Incorrect. That is 鈥渟ickness data鈥. Health data refers to search data, the information captured when we Google, for example travel, which indicates how healthy we are.</p> <p>Maxine is a member of <a href="https://deepmind.com/applied/deepmind-health/">DeepMind Health's</a> independent review panel. The board looks to build trust through radical transparency. She argued that we cannot expect the NHS or academia to afford the computational power required to get things right. Therefore we have to work together alongside the large corporations. Corporates can play an innovating role but they, and not just DeepMind, should not be enabled to profit from our data. We, the citizens, own the data. The government has a regulator role to play in protecting society.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p06p8yd2.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p06p8yd2.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p06p8yd2.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p06p8yd2.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p06p8yd2.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p06p8yd2.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p06p8yd2.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p06p8yd2.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p06p8yd2.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>Martin spoke further to the tensions between privacy and innovation. If we are too private with our data, there will be less innovation. He argued that the privileged of society are more likely to benefit from AI in terms of convenience. Data needs to work for people and for society. Misuses of machine learning based systems that have led to cruel justice were pointed to as an exemplar of the negative impact of the less privileged of society.</p> <p>The panel then moved on to the topics of ethics. There was a sudden interest in the ethical perspective. Ahmed asked if there is a risk of 鈥渆thics-washing鈥, using ethical defence to side-step issues such as privacy, autonomy, and agency? General consensus amongst the panel was that the UK is in a good place to be setting the agenda. Europe has a long tradition of setting human liberties. But we need to be ethical and enable innovation at the same time.</p> <p>The panel argued that unless citizens are personally affected by data breaches, they don鈥檛 really understand the repercussions. The public perspective is as much about when and how you ask, as whom you ask. We don鈥檛 need to teach kids to code. We need to teach young people to think about how coding impacts and why the control of data may be important.</p> <p>Maxine highlighted that NHS users are automatically opted in to their depersonalised confidential patient information being used for research and planning by the NHS, as well as commercial and academic partners. NHS data isn鈥檛 great but it does have scale. There are huge benefits for populational research. Health data was likened to taxes, a societal contract. Informed decision-making is important. I am happy to share my data in this scenario. Would you opt out of giving your health data?</p> <p>The discussion closed with a last thought-provoking question: "Can we put data solely in the hands of non-profits?". The panel argued that our health and justice systems need to be able to engage with organisations commercially. And sufficient profit is needed to run these organisations. The panel concluded that we need to define what we mean by 鈥渞easonable profit鈥 in this sense.</p> <p>For more details about upcoming events, visit <a href="https://www.meetup.com/Machine-learning-Fireside-Talks/?_cookie-check=t4snq-dMVnlSaLvP">麻豆约拍 Machine Learning Fireside Chat</a>.</p> </div> <![CDATA[Privacy by design]]> 2018-06-13T14:18:00+00:00 2018-06-13T14:18:00+00:00 /blogs/internet/entries/3e35ce9a-a8ac-49aa-9925-741c30738184 Adam Bailin <div class="component prose"> <p>This little button means a lot to us.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p067zctr.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p067zctr.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p067zctr.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p067zctr.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p067zctr.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p067zctr.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p067zctr.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p067zctr.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p067zctr.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>When it鈥檚 switched on it allows you to view personalised content recommendations based on your historic 麻豆约拍 activity data.</p> <p>I work for the 麻豆约拍 Analytics Services team in Cardiff and we鈥檝e been working on what happens when <a href="https://account.bbc.com/account/settings/privacy">you switch it off</a>.</p> <p>The 麻豆约拍 decided that when you switch off personalisation, it shouldn鈥檛 link any activity data to your account. In fact, we think that it shouldn鈥檛 ever be possible to tie activity data back to you. This page <a href="http://www.bbc.co.uk/usingthebbc/account/about-your-personalisation-settings/">about using the 麻豆约拍</a> expresses it well:</p> <p><em>鈥淒ata about how you use the 麻豆约拍 will be anonymous. For instance, we鈥檇 be able to see that someone looked at a particular story on 麻豆约拍 News, but we wouldn鈥檛 be able to tell if it was you.鈥 </em></p> <p>That鈥檚 actually quite a difficult software engineering problem to solve. The way most analytics tracking systems work is by using cookies or some other persistent identifier precisely to be able to tie together a user鈥檚 activity across multiple sessions. To get around that, we reset a user鈥檚 analytics identifier whenever they switch personalisation off.</p> <h4><strong>How it's normally done</strong></h4> <p>Normally, when you sign in or out of a service your analytics identifier will persist. This means that organisations can attribute data to you as an individual even when you鈥檙e signed out or have chosen not to receive a personalised experience.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p06b1ddq.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p06b1ddq.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p06b1ddq.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p06b1ddq.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p06b1ddq.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p06b1ddq.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p06b1ddq.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p06b1ddq.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p06b1ddq.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <h4><strong>How we've designed it</strong></h4> <p>We鈥檝e designed our analytics ID differently, with privacy in mind.聽We鈥檝e designed it so that when you sign in to your 麻豆约拍 account and disable personalisation, we can鈥檛 attribute activity back to you as an individual.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p06b1dg2.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p06b1dg2.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p06b1dg2.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p06b1dg2.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p06b1dg2.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p06b1dg2.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p06b1dg2.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p06b1dg2.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p06b1dg2.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>We鈥檝e designed privacy into our analytics.</p> </div> <div class="component prose"> <h4>Putting users in control of their data</h4> <p>The General Data Protection Regulation, or GDPR for short, is one of the biggest changes to data privacy law in recent years. It is designed to put you in control of how your information is collected and used by organisations.</p> <p>This change to our analytics services is a small example of how we are designing privacy as a feature to put users in control of their data. It is part of a wider opportunity to enable much greater control over how your data is collected, what you share and with whom. And in turn to drive a more relevant and nuanced personalisation of the 麻豆约拍鈥檚 services.</p> <p>We see privacy not just as an exercise in legal compliance but as an opportunity to deliver greater value for users.</p> </div> <![CDATA[Your data matters]]> 2018-05-22T14:04:00+00:00 2018-05-22T14:04:00+00:00 /blogs/internet/entries/7c605523-8df3-4dcb-bf58-7c64aa0b59a5 Julie Foster <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p06831gj.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p06831gj.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p06831gj.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p06831gj.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p06831gj.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p06831gj.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p06831gj.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p06831gj.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p06831gj.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>Last <a href="http://www.bbc.co.uk/blogs/aboutthebbc/entries/77bdafd0-20b3-414d-aa53-48786b194543">May</a>, we updated everyone on our plans to make the 麻豆约拍 more personalised and relevant to you. We can give you more of what you love when we understand you better, and also make sure that as a public service, we make something for everyone.</p> <p>We now have over 15 million people with 麻豆约拍 accounts using the 麻豆约拍鈥檚 websites and apps in the last month. What鈥檚 more, they are also using 麻豆约拍 websites and apps more than people who are not signed in. 64% of 麻豆约拍 account users visit 麻豆约拍 online more than 2 days per week, compared to 46% of all users. And when they are on 麻豆约拍 websites and apps, people with 麻豆约拍 account spend an additional hour per week than people not signed in.</p> <p>Your personal data is helping power this transformation. We can鈥檛 provide you with a meaningful personal or tailored experience without this information, but it is ultimately your data. And your data matters.</p> <p>The General Data Protection Regulation, or GDPR for short, is coming into enforcement in the next week. It makes sure that businesses clearly explain to you why they collect your personal data, and how they use it. It is an evolution of the Data Protection Act, and gives you new and important rights.</p> <p>As we鈥檝e said before, we鈥檝e built our new 麻豆约拍 account system with GDPR in mind, but we鈥檙e always reviewing our processes, technology and governance.</p> <p>We use your personal data for different reasons, and it鈥檚 important we are transparent to you why we collect and use this data. Our site <a href="http://www.bbc.co.uk/usingthebbc">鈥楿sing The 麻豆约拍鈥</a> spells out, in plain English, what we will (and importantly won鈥檛) do with your data. It also can help you exercise your GDPR rights, such as changing some of your details in Settings.</p> <h4>How have we prepared for GDPR?</h4> <p>For starters, you should not need to be a rocket scientist to know your rights. We鈥檝e updated our policies to make them even more transparent and clear.</p> <h4>What are my rights?</h4> <p>We鈥檝e created a new section in Using The 麻豆约拍 all about GDPR to help you understand what your rights are, how you can exercise them with the 麻豆约拍 and get help.<br />We鈥檝e also innovated and developed technology with data privacy at the heart of what we create. Below are a couple of examples of the kind of work we鈥檝e done to prepare for GDPR.</p> <h4>Privacy for children</h4> <p>We want to help you make sure that your child can only watch programmes, read comments and upload their creations in a space that is age appropriate and suitable to them. For this reason, we鈥檝e developed a way for parents or guardians to register their child which is simple and easy to do, but more importantly is safe and secure for the entire family. We want your children to get great experiences on 麻豆约拍 websites and apps, and play our part to help protect them.</p> <h4>Privacy by design</h4> <p>This little button means a lot to us.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p067w1hg.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p067w1hg.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p067w1hg.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p067w1hg.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p067w1hg.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p067w1hg.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p067w1hg.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p067w1hg.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p067w1hg.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>We really want you to have a personalised experience, like picking up where you left off watching a show, getting recommendations on programmes you might like or getting notifications about your favourite football team. But you have the right to <a href="https://account.bbc.com/account/settings/privacy">turn off</a> these features if you don鈥檛 want them. Our analytic services team has worked hard to develop a technical solution that can do this easily, and ensure your privacy.</p> <h4>What鈥檚 next?</h4> <p>We have some fantastic events coming this summer, from the <a href="http://www.bbc.co.uk/mediacentre/latestnews/2018/biggest-weekend-broadcast">Biggest Weekend</a> to <a href="/sport/football/44103384">FIFA World Cup </a>聽and Wimbledon. Your data is helping us learn what you like, so we can make sure you get the best out of this summer, and improve our services for you in the future.</p> </div> <![CDATA[Building a more personal and local 麻豆约拍 News website]]> 2017-11-08T09:55:00+00:00 2017-11-08T09:55:00+00:00 /blogs/internet/entries/59d55e8b-06f3-481a-b2c9-65a8908c5b08 Karolina Iwaszko <div class="component prose"> <p>We are making some changes to the way we provide local news, sport and other location driven information, like travel or weather updates online. <a href="/news/localnews" target="_blank">Our new pages</a> offer a view of what鈥檚 relevant and important to a person from the perspective of their locality and allow users to define how zoomed in they want their view to be.</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p05m94hf.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p05m94hf.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p05m94hf.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p05m94hf.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p05m94hf.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p05m94hf.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p05m94hf.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p05m94hf.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p05m94hf.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>If you enter or share your location, you鈥檒l get the latest 麻豆约拍 content about the location that matters to you. You can choose from between all locations in the UK, Channel Islands and Isle of Man (around 17,000 of them!). You will see the stories relevant for your selected location in the Latest Stories section 鈥 they are ordered based on publication time, with the most recent ones appearing at the top.</p> <p>You鈥檒l also see a selection at the top of the page of the stories that we think might interest you most, covering slightly wider areas around. To make it more personal and easier we are making it possible to save your chosen location for later, so when you return to the page, you will straight away access the content that is most relevant to you. Your location will be saved in cookies and shared between other 麻豆约拍 websites. Clearing cookies will remove this setting.</p> </div> <div class="component prose"> <p>We understand people can have different views of what is 鈥渓ocal鈥 so we have designed this new proposition to allow everyone to define the level of locality most suitable for them personally. If you are interested in what鈥檚 happening in the surrounding area you can extend the radius to bring in stories from further afield (up to a radius of 20 miles).</p> </div> <div class="component"> <img class="image" src="https://ichef.bbci.co.uk/images/ic/320xn/p05m947v.jpg" srcset="https://ichef.bbci.co.uk/images/ic/80xn/p05m947v.jpg 80w, https://ichef.bbci.co.uk/images/ic/160xn/p05m947v.jpg 160w, https://ichef.bbci.co.uk/images/ic/320xn/p05m947v.jpg 320w, https://ichef.bbci.co.uk/images/ic/480xn/p05m947v.jpg 480w, https://ichef.bbci.co.uk/images/ic/640xn/p05m947v.jpg 640w, https://ichef.bbci.co.uk/images/ic/768xn/p05m947v.jpg 768w, https://ichef.bbci.co.uk/images/ic/896xn/p05m947v.jpg 896w, https://ichef.bbci.co.uk/images/ic/1008xn/p05m947v.jpg 1008w" sizes="(min-width: 63em) 613px, (min-width: 48.125em) 66.666666666667vw, 100vw" alt=""></div> <div class="component prose"> <p>These changes are part of a wider piece of work designed to improve discovery and navigation around 麻豆约拍 content, as well as making the 麻豆约拍 more relevant to its audience, by offering more personalised products and services online. Other changes will follow and will see us moving away from our current local pages and towards a more refined service of personalised content.</p> <p>We are committed to continuing to improve the 麻豆约拍鈥檚 local news experience and we will be adding more features in the following months. We really welcome your <a href="http://www.bbc.co.uk/news/20039682" target="_blank">feedback</a>.</p> <p>You can find more about the new service <a href="/news/localnews/faqs" target="_blank">here</a>.</p> <p><em>Laura Ellis (Head of Digital, English Regions) and Karolina Iwaszko (Executive Product Manager, Design & Engineering).</em></p> </div>