<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Assero</title>
	<atom:link href="http://www.assero.co.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.assero.co.uk</link>
	<description>Implementing Standards for the eClinical Vision</description>
	<lastBuildDate>Tue, 15 May 2012 09:42:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>What I Want, What I Really Really Want</title>
		<link>http://www.assero.co.uk/2012/what-i-want-what-i-really-really-want/</link>
		<comments>http://www.assero.co.uk/2012/what-i-want-what-i-really-really-want/#comments</comments>
		<pubDate>Thu, 03 May 2012 08:52:31 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[55 in 5]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[CDISC SHARE]]></category>
		<category><![CDATA[CFAST]]></category>
		<category><![CDATA[data standards]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[SDTM]]></category>
		<category><![CDATA[Therapeutic Area]]></category>
		<category><![CDATA[Therapeutic Area Standards]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=559</guid>
		<description><![CDATA[It is not often that you can use lyrics from a Spice Girls hit single for a title but the words do sum up rather well the contents of this post. In a previous post, I mentioned the development of 55 therapeutic area standards being undertaken at the request of the FDA by CDISC and [...]]]></description>
			<content:encoded><![CDATA[<p>It is not often that you can use lyrics from a Spice Girls hit single for a title but the words do sum up rather well the contents of this post. In a <a href="http://www.assero.co.uk/2012/churchill-the-fda-and-a-fall/" target="_blank">previous post</a>, I mentioned the development of 55 therapeutic area standards being undertaken at the request of the FDA by CDISC and the Critical Path Institute, the so called 55 in 5 or CFAST project. This initiative is designed to extend the coverage of the CDISC domains from the so-called safety domains into the therapeutic areas. More information of the areas being targeted can be found on the <a href="http://www.fda.gov/Drugs/DevelopmentApprovalProcess/FormsSubmissionRequirements/ElectronicSubmissions/ucm287408.htm" target="_blank">FDA website</a>.</p>
<p>So this is an interesting initiative. It is important. In fact it is very important but first the bad news. The 55 areas represent a massive piece of work. It equates to 11 areas per year assuming that each takes a year. It suggests significant staff numbers, 10 or 15, may be more, full time staff. The big danger I perceive is that there is a risk of developing 55 sets of standards that are not fit-for-purpose and that do not take into account the lessons we should have learnt from earlier initiatives. The good news is that this can be done well and to the benefit of all, if it is well thought out.</p>
<p>It is easy to criticize, so this post is my perspective by someone who a) has been using standards within a pharmaceutical company and wants to benefit from emerging standards so as to streamline the processes within that company; and b) as someone who helps develop new standards and has spent time looking at the structure of content standards.</p>
<p>To begin, and just as an example of the need, I will outline a couple of issues encountered within my current work.</p>
<p>The first is the Tanner Scale. This is a scale (1 to 5) of physical development in children, adolescents and adults. Now where do I place this within SDTM in terms of domain? I would immediately think Questionnaire (QS). But I have seen it placed in Subject Characteristics (SC). Which is right? SDTM experts will justify one or the other and that justification might be based on study specific criteria. But I want it to be obvious to someone who has only read the SDTM Implementation Guide, the “average” user in industry. Now one question does arise? Should a piece of content always be placed within the same domain within SDTM? From the FDA viewpoint this makes sense when trying to pool or integrate data. Does it make sense from the FDA perspective for other uses? From other scientific perspectives it may not. These issues seem rather important to me but I have not heard any debate on these topics. The important issue is the fact that I and others can debate it. And that debate does not bode well for a standard. We need precision.</p>
<p>A second example is the recent release of CDISC questionnaire terminology. Within this package were Test Codes, Test Names and Category Codes for a number of instruments. One in particular that I have worked with is the Work Productivity and Activity Impairment (WPAI) scale. Absent however within the terminology were the results scale and associated terminology; I know the results scale in this particular case is not complex but there is still an opportunity for the “average” user to get it wrong. And therefore while I and other companies can align our tests, we cannot align the result. Here there are gaps in the definitions for a given instrument.</p>
<p>I use these two simple examples so as to highlight the sort of information that needs to be defined and what we want from the therapeutic area development. So below is my specification of the TA needs. This is my set. Please feel free to comment, add or disagree. No one person has a monopoly on good thoughts and/or ideas.</p>
<p>I have split my thoughts into four parts, Scope, Methodology, Outputs and Training:</p>
<ul>
<li>Scope: what do we mean by therapeutic area? Is it a core of definitions, some meaningful subset? These simple questions then lead to thoughts about versions and increments. It strikes me that we will never be able to deliver the complete scope of an area in one release. Also with a large number of areas being worked upon in parallel, are we in danger of flooding the user community? Can the FDA cope with the flood? Scoping is therefore important as are the release cycles and mechanisms.</li>
<li>Methodology: We need a methodology that delivers consistent structuring of definitions. I also want to understand the methodology so that I can develop content using it. This allows me to develop consistent content that I could also feed it back to the user community in an immediately useable structure. We simply cannot afford to have differences in the structure of the definitions.<br />
Also, these definitions will, in the long run, need to be placed into a repository; the industry metadata repository. If definitions are not structured consistently then we will hit problems when we try and load them into such a repository. Also any tools developed to process the data based on the definitions will have problems with inconsistent structuring of definitions.</li>
<li>Tools: This in turn leads to the need for tools so as to manage the consistent structuring of the definitions but also to manage the sheer amount of work needed. The tools do not need to be complex and sophisticated. We can manage with MS Excel tools in the first instance to get us going quickly and ensure that definitions are structured according to a methodology that brings consistency across the therapeutic areas. What we need to do is decided quickly what the structures are, how best to represent them, some simple tools to help manage them to get us going. Tools can also validate the content checking for consistency and completeness; does the content make sense and have we have filled everything in.</li>
<li>Outputs: We must have high quality outputs
<ul>
<li>Consistent &#8211; The outputs across therapeutic areas must be consistent within themselves and across the areas. This is something that has not been the case in the past.</li>
<li>Complete &#8211; The outputs must be complete. We cannot have incomplete terminology or any other item for that matter. To have such results in a non-standard.</li>
<li>Useable &#8211; The outputs must be usable. To be useable they must include not only a human readable forms (e.g. PDF) but also a machine readable forms such that a user can read into, or transform into a form suitable for reading into, their systems so as to gain immediate benefit. These formats will also allow sharing. If we have machine readable forms the human readable forms (PDFs) can be automatically generated.</li>
</ul>
</li>
<li>Training &#8211; We need to train people in the methodology and the tools so as to deliver the attributes discussed above. Without such training we will have great variation in the structure and quality of the deliverables.</li>
</ul>
<p>None of this is easy. But clarity of thinking around the methodology and tools to support such will pay great dividends. They tools, in the first instance, do not need to be sophisticated. What we do need is the 10, 20 or whatever the number of people working on the project need to understand is what they are producing and why, they need to understand the methodology and be advocates for it.</p>
<p><em>The views expressed within this post are mine and do not represent the views of any organisation I volunteer for or client that Assero is employed by.</em></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2012/what-i-want-what-i-really-really-want/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Churchill, the FDA and a Fall</title>
		<link>http://www.assero.co.uk/2012/churchill-the-fda-and-a-fall/</link>
		<comments>http://www.assero.co.uk/2012/churchill-the-fda-and-a-fall/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 12:17:27 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDER]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[Therapeutic]]></category>
		<category><![CDATA[Therapeutic Area]]></category>
		<category><![CDATA[Therapeutic Area Standards]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=539</guid>
		<description><![CDATA[“When I feel like exercising I just lie down until the feeling goes away.” According to the web this quote is from Robert M. Hutchins (1899 –1977) and according to Wikipedia, he was an educational philosopher, dean of Yale Law School, and president and chancellor of the University of Chicago. The quote was used by Bernard de [...]]]></description>
			<content:encoded><![CDATA[<p>“When I feel like exercising I just lie down until the feeling goes away.” According to the web this quote is from Robert M. Hutchins (1899 –1977) and according to Wikipedia, he was an educational philosopher, dean of Yale Law School, and president and chancellor of the University of Chicago.</p>
<p><span id="more-539"></span>The quote was used by Bernard de Bono from the European Bioinformatics Institute, during a question and answer session at the CDISC EuroInterchange that took place in Stockholm last week.</p>
<p>Bernard was talking in the context of creating a new term, a new piece of terminology, a new definition. His plea was don’t do it and hence the quote, go and lie down until the feeling passes. His argument was that, if you do create the new term you are a) saying that this is some new knowledge because no one has defined it before and b) you are creating a maintenance problem.</p>
<p>I am going to leave the first point; I shall leave that for another day. What I want to focus on is the notion of the creation of the maintenance issue. Since the New Year, it has become apparent to me that CDISC has, over the last year or two, hit a new issue. I have discussed the issue with a few folks over the past two months that the standards are now mainstream; as Chuck Cooper from CDER stated at the interchange, 40% of electronic submissions now use SDTM. This, however, means that the standards now need to be maintained. Exhibit one, <a title="CDISC SDTM Amendment 1" href="http://www.cdisc.org/stuff/contentmgr/files/0/3fa5f30f40ce5ecc7b3f91e558b55f73/misc/amendment_1_to_study_data_tabulation_model_v1_2_and_sdtmig_v3_1_2_final_2011_12_14.pdf" target="_blank">Amendment 1 to SDTM</a>, Exhibit two the <a title="FDA Draft Guidance" href="http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/UCM292334.pdf" target="_blank">CDER guide on standards</a>. And this is where we hit the big problem. Volunteers and maintenance don’t mix too well.</p>
<p>Another challenge for CDISC is the simple amount of work it is attempting to undertake. This was witnessed by the Gantt chart presented by Wayne Kubick – the new CDISC CTO – that lists 23 major areas of work. In fact the list is probably longer as some of the items contain multiple projects and the list includes only 10 of the 55 therapeutic areas that has been mentioned as being tackled over the coming five years.</p>
<p>I discussed this with a few people at the interchange. The therapeutic area development is a major challenge – more on this in a future posts – but the one issue that came out of the conversations is that are we simply setting ourselves up for a fall. By trying to do too much are we simply bound to fail? And by failing do we damage the good work achieved to date?</p>
<p>There are no easy answers. Is it that CDISC now needs a new business model? Is it time that we have to accept that CDISC needs more resources? Do we have to give up on the idea of volunteer led standards development?</p>
<p>In a decade or so CDISC and the industry have come a long way. 40% standardised electronic submissions is no mean feat. As I noted above, the session within which Bernard made his quote was a panel, question and answer session. I always feel for the panel in these circumstances. I have been there, you can never win. The questions are always of the negative nature. Why have you not done that? When are you doing this? Why did you do that? We focus on the negative. This post is focusing on the negative in that it is looking at the challenge ahead. However, we also need to be realistic.</p>
<p>I believe that Bernard attributed the exercise quote to Winston Churchill. If I have got this wrong then apologies to Bernard but when I looked on the wonderful web it seemed to be from Robert Hutchins. But if I may borrow and plagiarise another Churchill quote: Never in the field of standards development has so much been owed by an industry to so few volunteers. CDISC has achieved so much. Let us not waste it by trying to do too much in the future without the required resource.</p>
<p><em>And one final note. The views expressed within this post are mine and do not represent the views of any organisation I volunteer for or client that Assero is employed by.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2012/churchill-the-fda-and-a-fall/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Mad March and the FDA</title>
		<link>http://www.assero.co.uk/2012/mad-march-and-the-fda/</link>
		<comments>http://www.assero.co.uk/2012/mad-march-and-the-fda/#comments</comments>
		<pubDate>Sun, 18 Mar 2012 14:00:19 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[CDISC SHARE]]></category>
		<category><![CDATA[DIA]]></category>
		<category><![CDATA[eClinical]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[PhUSE]]></category>
		<category><![CDATA[standards]]></category>
		<category><![CDATA[Working Groups]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=526</guid>
		<description><![CDATA[It has been a busy few weeks for both work and personal reasons. For those who know us, it is lovely to announce that Amanda (AJ) and I got married on the first day of March. Some pictures from a friend can be found here and many thanks from us both to those who have sent [...]]]></description>
			<content:encoded><![CDATA[<p>It has been a busy few weeks for both work and personal reasons. For those who know us, it is lovely to announce that Amanda (AJ) and I got married on the first day of March. Some pictures from a friend can be found <a href="https://www.facebook.com/media/set/?set=a.10150614294440369.384523.682320368&amp;type=3&amp;l=4f545018f6" target="_blank">here</a> and many thanks from us both to those who have sent kind wishes.</p>
<p>So if that did not make March frantic enough, I find myself sitting on a plane heading to the US for an exciting week of volunteering. The major event of the week is the <a href="http://www.phuse.eu/CSS.aspx" target="_blank">FDA /PhUSE Computational Science Symposium</a> with the second half of the week being taken up with a CDISC Technical Leadership Committee (TLC) meeting and a CDISC SHARE team face-to-face meeting.</p>
<p>I was surprised and rather honoured – this is English understatement – to be invited to participate in the PhUSE meeting. In the past two years, the DIA have run the event but it has only really manifested as a two day meeting at which people gathered and after two days left. With this event, the FDA is very much driving the idea that the two day gathering is an initial kick off meeting with working groups continuing their work going forward through 2012 and into 2013.</p>
<p>At the current time six working groups have been set up and have undertaken some preliminary work with the real work starting over the next two days:</p>
<ul>
<li>Working Group 1 Validation – focusing on collaborating to develop a robust process to rapidly validate and assess data quality as data moves through the product life cycle across both industry and regulatory review.</li>
<li>Working Group 2 Site Selection – continuing the ongoing work developing standards for the data set used in a risk model designed to improve the quality of clinical inspection site selection for drug trials.</li>
<li>Working Group 3 Integration – looking at the challenges of analyzing and integrating data from retrospective studies from both an industry and regulatory perspective</li>
<li>Working Group 4 CDISC Implementation – looking at specific challenges with implementing the CDISC data standards</li>
<li>Working Group 5 Standard Scripts – focusing on standard scripts for data transformations and analyses across and/or within a therapeutic area and exploratory analysis.</li>
<li>Working Group 6. Non-Clinical – discussing various approaches to the standardization of nonclinical data and the possible implications to the development and implementation of data standards.</li>
</ul>
<p>The meeting has sold out with 300 registrations being accepted by PhUSE for the event. It should be an interesting event and it will be exciting to see the work that will emerge from the working groups.</p>
<p>I will endeavour to tweet news over the two days.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2012/mad-march-and-the-fda/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Metadata and Layers</title>
		<link>http://www.assero.co.uk/2012/metadata-and-layers/</link>
		<comments>http://www.assero.co.uk/2012/metadata-and-layers/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 07:09:10 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[CDISC SHARE]]></category>
		<category><![CDATA[data standards]]></category>
		<category><![CDATA[eClinical]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=492</guid>
		<description><![CDATA[I have been wandering around over the last six months or so with a couple of pictures in my head that I have at last managed to commit to paper. The pictures in question try and explain why we would want to layer our metadata and through the layers link data standards and, ultimately, why [...]]]></description>
			<content:encoded><![CDATA[<p>I have been wandering around over the last six months or so with a couple of pictures in my head that I have at last managed to commit to paper. The pictures in question try and explain why we would want to layer our metadata and through the layers link data standards and, ultimately, why the line between those standards may well begin to grey and ultimately may disappear altogether.</p>
<p>So lets start by considering the writing of a protocol. Within the protocol document today, we typically see statements such as “Demographics (age, gender, height and weight)” or “The following baseline demographics and disease characteristics will be captured in the eCRF: Sex, age, body weight, height/length, and race.” Through statements such as these the protocol writer is giving us the first indication of what data is to be captured as part of the study but at a high level using the name of a “concept” such as AGE, SEX and so on.</p>
<div id="attachment_496" class="wp-caption alignright" style="width: 310px"><a href="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-1.jpg"><img class="size-medium wp-image-496" title="Metadata Flow" src="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-1-300x197.jpg" alt="" width="300" height="197" /></a><p class="wp-caption-text">Figure 1 - The flow of metadata</p></div>
<p>The reason I put the word concept in quotes is that this single word conveys many different meanings to many people, it is desperately overloaded. What I mean when using the term is a collective designator for one or more atomic items of data that in combination represent some meaningful notion or idea. AGE is a simple example. It is meaningful to most people, consists of value and units and we can provide a good definition for the concept.</p>
<p>So in the first diagram, I see the initial use of the concept within the protocol simply via use of its name. That protocol document is then converted into a study implementation that consists of a set of paper or electronic CRFs – it should be noted that I don’t distinguish the usefulness of metadata simply because of the use of paper CRFs, they need to follow the same rules – with the CRF expanding the concept into its constituent parts.  In addition to the basic definition we will also be interested in how the data are to be captured, how the units are to be presented and other such considerations.</p>
<p>In my simple example AGE, as I said, will consist of two parts, the AGE value and the associated AGE UNITS and we will need to capture these items on a CRF. Now most of the time the units will be fixed, years most likely, may be months but we still need units to make sense of the value. I am of the school that likes to have the variable hidden within the system even if the user cannot change it simply for the purposes of consistency across studies when data are exported. And obviously this data will be collected for the N subjects within the study, hence the multiple CRFs in figure 1.</p>
<p>Once we have our data collected, we will want to take the values into a tabulation with columns holding the AGE and AGE UNITS values from the N subjects. As we move into the world of the tabulation from that of the CRF, we will need to think about the columns and definitions in relation to the dataset as a whole and information about the dataset itself such as its name.</p>
<div id="attachment_494" class="wp-caption alignleft" style="width: 310px"><a href="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-2.jpg"><img class="size-medium wp-image-494" title="The Common Parts" src="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-2-300x196.jpg" alt="" width="300" height="196" /></a><p class="wp-caption-text">Figure 2 - Extract The Common Parts</p></div>
<p>So what we see is that in each part of the above chain – I stopped at the tabulation rather than adding further use cases such as analysis dataset, warehousing, data aggregation etc –  there is a common set of information used within each use case but that each use case also requires its own particular metadata. And for these reasons, I created the second diagram and show in figure 2 that we would like to extract that common core definition and bring it into one place to be shared across the use cases.</p>
<p>This then leads to a desire to organize the metadata in a layered fashion as shown in figure 3. In this simplified view, we want to put at the base of our metadata &#8220;stack&#8221; our code lists &#8211; the controlled terminology &#8211; and then in the layer above the definition of  our concept. This provides our core building blocks, a definition that is independent of the use case but consistent across all use cases.</p>
<p>On top of these two lower levels, we can then layer on top the extra metadata applicable to the use case, the CRF for example with the presentation information and such items as range/edit checks that we would need to assist in collecting data. For the tabulation, we want to add precision around the column definitions, may be formats, whether a value is required or is optional, rules for missing values etc.</p>
<div id="attachment_495" class="wp-caption alignright" style="width: 310px"><a href="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-3.jpg"><img class="size-medium wp-image-495" title="The Layers" src="http://www.assero.co.uk/wp-content/uploads/2012/02/2012-02-06-Layed-Metadata-3-300x181.jpg" alt="" width="300" height="181" /></a><p class="wp-caption-text">Figure 3 - The Metadata Layers</p></div>
<p>One thing to note about the diagram is the warehouse and that I have placed the Study Configuration linked to the warehouse. In reality, the Study Configuration will be created as part of the study build process earlier in the life-cycle but my thought was that the use cases that best illustrate its relevance in the metadata world is the desire to aggregate data across studies. This need drives the requirement that we know what is on each of those studies in the sense of the study construction all the way down to concepts, variables and code lists. Hence my decision to draw it there for the sake of an uncluttered picture.</p>
<p>Finally the submission and review, where the metadata needs to be passed to the regulatory authority, define.xml is one such item but annotated CRF and other such items are also applicable.</p>
<p>I have tried to keep this short and sweet but get the message across. We should be organising our metadata. It needs to be well structured.</p>
<p>It is good to be back writing again, hopefully a few more blogs will flow over the coming weeks.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2012/metadata-and-layers/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>eSource and a Patent</title>
		<link>http://www.assero.co.uk/2011/esource-and-a-patent/</link>
		<comments>http://www.assero.co.uk/2011/esource-and-a-patent/#comments</comments>
		<pubDate>Thu, 17 Nov 2011 11:32:00 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[clinical trials]]></category>
		<category><![CDATA[electronic source]]></category>
		<category><![CDATA[electronic source data]]></category>
		<category><![CDATA[EMA]]></category>
		<category><![CDATA[EMEA]]></category>
		<category><![CDATA[eSDI]]></category>
		<category><![CDATA[eSource]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[ODM]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=463</guid>
		<description><![CDATA[A few days ago I was alerted to a discussion thread on LinkedIn by a good friend. I started reading and became increasingly alarmed by what I found. The thread can be found here. Please read the thread. My concern is based around a commercial organization taking out a patent on work that I consider [...]]]></description>
			<content:encoded><![CDATA[<p>A few days ago I was alerted to a discussion thread on LinkedIn by a good friend. I started reading and became increasingly alarmed by what I found. The thread can be found <a href="http://www.linkedin.com/groupItem?view=&amp;gid=1066557&amp;type=member&amp;item=79202303&amp;qid=de1d1e4a-70da-4e96-bfcc-c0fc36134c93&amp;trk=group_most_popular-0-b-ttl&amp;goback=%2Enmp_*1_*1_*1_*1_*1_*1%2Egmp_1066557" target="_blank">here</a>. Please read the thread.</p>
<p>My concern is based around a commercial organization taking out a patent on work that I consider to be already in the public domain. What is worse is that I, and many others, worked hard to not only produce the idea but also ensure that the work was placed into the public domain with no restrictions on who could use it.</p>
<p>My story begins in 2000 and before working with eDiaries within clinical trials. When working with customers, about the fourth question they asked was, – this is after the “is it part 11 compliant” question – &#8220;Where is the source data?&#8221; At that time, I did not have an answer but it interested me. So, when in 2002, I had some time, I started reading everything I could find on the regulations about source data. I remember reading papers by Paul Bleicher, then at Phase Forward, and by Steve Raymond of PHT fame and discussing the issues with these learned folks at DIA meetings. In 2003, I wrote a paper for the peer-reviewed, Applied Clinical Trials. In order to get that article past peer-review, I remember having to remove a suggestion that the basis of an eSource Data Store should be the CDISC Operational Data Model (ODM). Apparently, the suggestion of using standards at that time was considered too controversial! The <a href="http://appliedclinicaltrialsonline.findpharma.com/appliedclinicaltrials/article/articleDetail.jsp?id=84818" target="_blank">article was published in February of 2004</a>.</p>
<p>By this time, I was working with CDISC furthering the cause of the ODM with the FDA. I have just looked back at presentations that I gave with Dr Rebecca Kush at the FDA in late 2004, extolling the virtues of the ODM and starting to discuss eSource. In late 2004 the FDA invited CDISC to look at the question of eSource and the existing regulations. I was fortunate enough to co-lead the team and, in November of 2006, the final version (after six drafts, four of which went out for public review) of the eSDI paper was published.</p>
<p>While working on the eSDI paper, CDISC were attacked for having the audacity to tackle the subject and there were those on the CDISC Board who questioned whether CDISC should be dealing with such a topic. But in the end the work was well worth it because that document lays the foundation for the use of eSource within clinical trials, irrespective of technology. By developing the 12 principles that are described within the document, any solution can be assessed and seen to meet the regulatory need or not. The paper also presented five potential architectures.</p>
<p>Then came something that I am proud of. The EMA decided to base their eSource guidance on the eSDI work and by doing so, I believe, improved on the eSDI content. The FDA also updated the Computerized Systems Used in Clinical Trials (CSUCT) document to become the CSUCI guidance document and put a toe into the eSource waters. Unfortunately the FDA then released a draft eSource guidance and failed to follow the EMA’s lead by utilizing the CDISC paper.</p>
<p>In parallel with all of this, I was working with Landen Bain as he led the charge to look at practical – and I stress the practical – ways to integrate Electronic Health Records with Clinical Research. Through Integrating the Healthcare Enterprise (IHE), he led the work on the RFD profile that came to fruition in 2007, with several demonstrations of capability at the HIMSS conference in New Orleans. Included in that demonstration was the first incarnation of an eSource Archive based upon the CDISC ODM. This demonstration turned into a formally developed system that has been used in subsequent IHE connectathons and HIMSS events both in the US and Europe. It has been used in several demonstrations of eSource capability to the FDA and has been demonstrated at the 2011 DIA Annual conference. That eSource tool contains the ability to use ODM as an eSource Document, read them, it has audit trails, user roles and leverages the ODM’s metadata to provide meaningful displays of the data and audit trail.</p>
<p>The picture I am painting is one of several threads of activity, all undertaken within open standards development organizations since 2004, with many volunteers and organizations giving their time feely and working in a spirit of cooperation to develop solutions that are for the benefit not only for those giving their time but also for the benefit of clinical research organizations (in the most generic sense), regulatory agencies and ultimately the public with the provision of better and safer products.</p>
<p>I looked back through many presentations that I have given on eSource. Within these presentations are ideas for system topologies, descriptions of the issues and many suggestions. Audiences include the FDA, EMA GCP inspectors and DIA meetings in both the US and Europe.</p>
<p>I am deeply saddened by the discussion in the thread. Having put in many hours working with many others who have also made significant contributions to bring clarity to a difficult subject for the benefit of all, it is galling to consider that an organization can patent ideas where there appears to be a huge overlap with the work done by CDISC and IHE. I have yet to see anything within this patent &#8211; I am happy to be proved wrong -  that is not covered by the work undertaken by many volunteers since the first steps back in 2004.</p>
<p>Here are just a few of the presentations across the year&#8217;s to give a flavour</p>
<h4>DIA Euro Meeting, Lisbon, March 2005</h4>
<p>Slides 37 through to 43 inclusive discuss the Trusted Third Party</p>
<iframe src="http://www.slideshare.net/slideshow/embed_code/10190673" width="650" height="526" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe><br/><br/>
<h4>EMEA, London, June 2006</h4>
<p>Slides 49 &amp; 50 are interesting. They present the generic eSource archive/store architecture.<br />
<iframe src="http://www.slideshare.net/slideshow/embed_code/10190300" width="650" height="526" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe><br/><br/></p>
<h4>DIA CDM Meeting, Florida, March 2007</h4>
<p>Slides 22 to 25 show the relationship between the IHE RFD profile, eSource and a Trusted Third Party (the Archiver)</p>
<iframe src="http://www.slideshare.net/slideshow/embed_code/10197640" width="650" height="526" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe><br/><br/>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/esource-and-a-patent/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The FDA and a Round World</title>
		<link>http://www.assero.co.uk/2011/the-fda-a-round-world/</link>
		<comments>http://www.assero.co.uk/2011/the-fda-a-round-world/#comments</comments>
		<pubDate>Tue, 26 Jul 2011 07:12:16 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[ADaM]]></category>
		<category><![CDATA[CDASH]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[data standards]]></category>
		<category><![CDATA[eClinical]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[SDTM]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=451</guid>
		<description><![CDATA[The term “round data” conjures up many images. Coined, I believe, by the FDA about three years ago, there is no precise definition of this term. If I asked five people what does “round data” mean, I would probably get six opinions, some abuse and I would probably be reluctant to ask the question ever [...]]]></description>
			<content:encoded><![CDATA[<p>The term “round data” conjures up many images. Coined, I believe, by the FDA about three years ago, there is no precise definition of this term. If I asked five people what does “round data” mean, I would probably get six opinions, some abuse and I would probably be reluctant to ask the question ever again.</p>
<p>So it is for those reasons that this is one of those difficult blog posts to write simply because the subject matter is one that has caused quite a bit of confusion, as well as being one that has resulted in a releasing of emotions in the past. So either this post will result in a flurry of comments or a deathly silence.</p>
<p>I cannot start off by giving that wonderfully simple definition. Wouldn’t it be wonderful to simply say “round data is …” however. I can’t. But I believe the vision of round data, for that is what it is really, a vision. I believe round data is all about context. I have taken part in many conversations about the CDISC standards and context and I have even understood some of them. In the current standards, you will find that there is a lot of implicit context or assumptions. It is these assumptions that get us into trouble because implementation of the same standard will undoubtedly make different assumptions and thus we don’t have a standard, we have two similar implementations.</p>
<p>Consider the value, 180. That one item of information tells you nothing. If I add the units “cm” so as to make 180 cm, you now know it is a dimension. Given the magnitude you might make an assumption it is a height. Now if I add HEIGHT, to build HEIGHT=180 cm I am beginning, but only beginning, to add the context around the recording – or observation – of the height.</p>
<p>But more questions immediately emerge. When did I observe, why did I observe, what happened before and what happened after? So the observation needs to be placed into the context of the method of collection, the collection instrument, be it a case report form or a medical record both of which may be paper or electronic. In the event of the case report form that probably exists within the context of a study and thus a protocol. So as you start to see from this very simple picture of our clinical research world, the context of a simple observation becomes increasingly complex.</p>
<p>What we strive for is to place the observation into the right context. A human does that very well. You or I can read the protocol, examine the annotated CRF and look at the resulting dataset. But if we want to get more from our data, to be easily able to aggregate across studies, we need the computer, the machine, to have access to that same context. Some of the things we might wish to see are:</p>
<ul>
<li>Relationship between a clinical observation and the planned observations, did we capture everything intended? Did we capture more? Which observation is which?</li>
<li>Parent child relationships, for example, these multiple results came from a single sample</li>
<li>Relating adverse events to interventions</li>
<li>Terminology relationships</li>
</ul>
<p>We have some of these relationships today but they are, as I have said, implicit. The value and units of our observation are related but only by an implicit positional relationship because the two columns are next to each other in the dataset and the data specification (human readable only) tells me what those two columns contain. We need to make those relationships tangible. SDTM has gone a long way to achieving this but suffers from only providing them in a human-readable form and not having the scope of relationships that we may wish to see in the future.</p>
<p>We therefore need to build on SDTM and all the other good work to get to the round world. But the relationships extend across a single standard. Consider the issue of traceability of submitted data back to the case report form. The relationships in this scenario encompass not only SDTM but CDASH and potentially ADaM.</p>
<p>So a definition for round data? I would suggest round data as being nothing more than a richness of relationships. The challenge is defining the relationships we want, how best to describe them and how to transmit them.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/the-fda-a-round-world/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Blogs &amp; eSource: some ponderings</title>
		<link>http://www.assero.co.uk/2011/blogs-esource-some-ponderings/</link>
		<comments>http://www.assero.co.uk/2011/blogs-esource-some-ponderings/#comments</comments>
		<pubDate>Mon, 11 Jul 2011 13:30:59 +0000</pubDate>
		<dc:creator>AJ</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[DIA]]></category>
		<category><![CDATA[eClinical]]></category>
		<category><![CDATA[eSource]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=412</guid>
		<description><![CDATA[Over the last few months, Dave and I have been working hard and as a consequence our blogging has not been as prolific as it has been in past months. However, like all good ‘bloggers’ we continue to read the blogs of others with fervour. So I have been intrigued by a few posts in [...]]]></description>
			<content:encoded><![CDATA[<p>Over the last few months, Dave and I have been working hard and as a consequence our blogging has not been as prolific as it has been in past months. However, like all good ‘bloggers’ we continue to read the blogs of others with fervour. So I have been intrigued by a few posts in recent weeks that I thought I would draw your attention to, either because like me you are interested in the power of social media or because as the biopharmaceutical industry moves forward, social media goes with it.</p>
<p>First of all I thought I’d draw your attention to <a title="John Halamka" href="http://geekdoctor.blogspot.com/2011/07/can-blogging-be-harmful-to-your-career.html">John Halamka’s blog</a>. It’s an interesting mix of highly technical records of the current developments in the HIT world (which at times seems mind boggling!) and government policy, along with some new and interesting technology observations and some very personal revelations about his family. I don’t think I have ever read anything controversial or even remotely tricky in his Blogs (which in my opinion is not the purpose of a Blog, it should always raise questions and invite comments), however to read that he has had his Blog used against him was rather surprising. Blogs and other forms of social media are public documents that are there to engage an audience; they are snapshots of thoughts and ideas that help to create a better understanding of our industry. I think we need to be careful about what we say but not so careful that freedom of expression and good debate are squandered for dry renditions of current developments that don’t interest anyone! Most criticism seems to come from those who either don’t write blogs themselves or can’t imagine how anyone who is doing any work has time for such linguistic efforts.</p>
<p>So moving on, CDISC have been producing some interesting <a title="CDISC Blog" href="http://www.cdisc.org/cdisc-blog" target="_blank">Blogs</a> about the Interoperability Showcase at the recent DIA Annual meeting in Chicago (<a title="DIA Interoperability Showcase Webinar" href="https://diahome.webex.com/ec0605ld/eventcenter/recording/recordAction.do;jsessionid=xhrpTh0VGqbpL25Yyp6Tv9VK2SvFYTQ1Q7zY9Q9DmBSMnCGRSv9t!-808378119?theAction=poprecord&amp;actname=%2Feventcenter%2Fframe%2Fg.do&amp;apiname=lsr.php&amp;renewticket=0&amp;renewticket=0&amp;actappname=ec0605ld&amp;entappname=url0107ld&amp;needFilter=false&amp;&amp;isurlact=true&amp;entactname=%2FnbrRecordingURL.do&amp;rID=48060317&amp;rKey=8a917bc6305aee90&amp;recordID=48060317&amp;rnd=1441953299&amp;siteurl=diahome&amp;SP=EC&amp;AT=pb&amp;format=short">There&#8217;s a good webinar about it here</a>). For Dave and I, this has a personal interest as we are an integral part of the Archiving solution for the <a title="RFD" href="http://wiki.ihe.net/index.php?title=Retrieve_Form_for_Data_Capture">Retrieve Form for Data (RFD)</a> capture profile (developed by IHE)<a title="IPL" href="http://www.ipl.com/" target="_blank"> along with our partners IPL</a>. From the first moment Dave and I started working together, he had a deep conviction about the way that an electronic archiver could solve many of the ‘paper’ based challenges for research and also that having an archiver meant that health records did not have to be 21 CFR Part 11 compliant (<a title="FDA 21 CFR Part 11" href="http://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfcfr/cfrsearch.cfm?cfrpart=11" target="_blank">This is a fairly weighty but important tome which you can find here</a>). Dave and Dr Kush (CEO CDISC) worked together on the highly influential <a title="eSource Data" href="http://www.assero.co.uk/standards-for-eclinicial/electronic-source-data/" target="_blank">eSDI document</a> that has been referenced by EMA and I was extremely privileged to be at White Oak when they met with the FDA to give some clarity around the 12 scenarios. You can find out more about the archiver<a title="Contact Us" href="http://www.assero.co.uk/contact-us/"> by contacting us</a> and of course we are more than happy to explain how important it is to the success of RFD. One of the key benefits of the archiver however is the notion of traceability. When the regulator asks to see the source of your data 7 years after you started your study, can you find it easily? Is the physician’s office where you conducted the original research still there? Is the CD of the paper report forms still in existence and does it still work? What if you are looking for data to look for new indications for a therapy? Can you quickly retrieve all the original data? If you don’t have a central source, this can be tricky at best, time consuming at worst or perhaps a complete disaster if you can’t find the data at all. Another important consideration is that while there are disparate electronic health records and systems that have been implemented globally, the RFD solution is the best possible way to capture research data in order to develop that truly electronic trial. This is something Assero is passionate about (in fact Dave has written about it<a title="Searching for an eClinical Vision?" href="http://www.assero.co.uk/2011/searching-for-an-eclinical-vision/"> here in a recent blog</a> and our main page on <a title="Standards for eClinicial" href="http://www.assero.co.uk/standards-for-eclinicial/">Standards for eClinical also outlines what we do</a>). It was a shame that Dave and I were simply too busy to attend the Showcase and the Annual but I am hoping a certain Landen Bain will be bringing his great carnival to Europe next year, as after all we have electronic health records here too and it would be interesting to get a discussion going this side of the pond!</p>
<p>Finally, I know that the on-going debate about social media for research has moved on from when <a title="Social Media for Clinical Research Data?" href="http://www.assero.co.uk/2011/social-media-for-clinical-research-data/">I wrote one of my previous blogs</a>, there has been news about Pfizer using social media (or an interactive method for collecting data)  and there is a good blog here about the relationships involved and participation in trials<a title="Pfizer Interactive Trial" href="http://rebarinteractive.com/pfizer-home-clinical-trial/" target="_blank"> where you don’t necessarily talk to a  physician</a>. It makes for an interesting read and, as I suspect the industry seeks better data and compliance by patients, new methodologies will be sought to capture data of that I am sure. So watch this space as the debate rumbles on and don&#8217;t be shy, let us know your thoughts.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/blogs-esource-some-ponderings/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Educating the Ontologically Challenged, Part 2</title>
		<link>http://www.assero.co.uk/2011/educating-the-ontologically-challenged-part-2/</link>
		<comments>http://www.assero.co.uk/2011/educating-the-ontologically-challenged-part-2/#comments</comments>
		<pubDate>Fri, 01 Jul 2011 13:55:51 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[OWL]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[Semantic]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[Semantics]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=402</guid>
		<description><![CDATA[At last I get the chance to write the first answer to the series of questions I, along with Kerstin, posed in the first blog post in this series The question was “What is the semantic web and linked data?” So first of all Kerstin suggested that I read an article  An Intro To The [...]]]></description>
			<content:encoded><![CDATA[<p>At last I get the chance to write the first answer to the series of questions I, along with Kerstin, posed in the first <a href="http://www.assero.co.uk/2011/educating-the-ontologically-challenged/" target="_blank">blog post i</a>n this series</p>
<p>The question was “What is the semantic web and linked data?” So first of all Kerstin suggested that I read an article <em> <a href="http://www.webcentralstation.ca/2011/02/08/an-intro-to-the-semantic-web-why-you-need-to-know-about-it-sooner-than-later/" target="_blank">An Intro To The Semantic Web: Why You Need To Know About It Sooner Than Later</a></em><em> </em>and watch a video <em><a href="http://www.youtube.com/watch?v=GKfJ5onP5SQ" target="_blank">Linked Data (and the Web of Data), a 4-minute video from Ireland’s Digital Enterprise Research Institute (DERI)</a>. </em><a href="http://www.youtube.com/watch?v=GKfJ5onP5SQ"><em></em></a><em><br />
</em></p>
<p>I did not feel that either gave me that one sentence answer I was seeking to the question posed. I wanted something short and sweet. Kerstin and I had a chat via a web conference, she sitting in a café in Goteborg with me in the UK, the power of the internet allowing us to connect and discuss. Around the same time Kerstin posted a link via twitter (you can follow her via @kerfors) to a presentation by<a href="http://www.slideshare.net/juansequeda/introduction-to-linked-data-8223364" target="_blank"> Juan Sequeda on Slide Share</a>. So I looked and read. Fortunately this particular slide set and explanation made far more sense to me.</p>
<p>I also did that thing we all do when faced with explaining some concept to people when we ourselves do not really know the answer, we step back to first principles. We used to reach for the dictionary on the shelf, now we go to the technological wonder that is Wikipedia.</p>
<p style="padding-left: 30px;"><em>The Semantic Web is a &#8220;web of data&#8221; that enables machines to understand the </em><em>semantics</em><em>, or meaning, of information on the </em><em>World Wide Web</em><em>.</em></p>
<p>After digesting these two additional sources, in particular the presentation by Juan Sequeda, the light bulb in my brain started to glow, a little dimly may be, but glow nonetheless. My conclusion?</p>
<p style="padding-left: 30px;"><em>While the web we know today is a series of documents designed for human consumption, the semantic web wants to permit access to all the data across the web and allow machines to process it.</em></p>
<p>Now Kerstin changed this in the draft of the article to:</p>
<p style="padding-left: 30px;"><em>While the web we know today is a series of documents designed for human consumption, the semantic web wants to <span style="text-decoration: underline;">make data ready for automation where machines can directly process it and hence give humans new insights</span></em></p>
<p>But there may be something missing. And it is that word semantic in “Semantic Web”. We also want to understand the data, get at the meaning of the data. So my final version of a description for the semantic web:</p>
<p style="padding-left: 30px;"><em>While the web we know today is a series of documents designed for human consumption, the semantic web wants to expose the data, allow the meaning of the data to be understood and permit the data to be consumed by machines so as to give humans new insights about the data</em></p>
<p>To allow the machine access, we need a mechanism by which data sources can describe and expose their data. This is not something I want to discuss in this post but we will certainly be returning to this topic in subsequent ones. Now, if two sites publish their data and there is a common component within those data (some part is common), then the two sets of data can be linked. And thus we have linked data.</p>
<p style="padding-left: 30px;"><em>The semantic web is a way of publishing data, linked data is the process of linking it together.</em></p>
<p>One final thought that crossed my mind as I was reading through the materials referenced above is about the definition of the data. The semantic web seems, and I stress this is an initial impression bred from ignorance, to be built on less than precise definitions. This is another question we will look into in subsequent posts.</p>
<p>So our set of questions can now be extended to:</p>
<ol>
<li>What is      the semantic web and linked data? <em>See      this blog post</em></li>
<li>What is      an ontology?</li>
<li>Where do      RDF and OWL fit?</li>
<li>How do I      link my data?</li>
<li>How      precise are the definitions?</li>
<li>What      could these things, if they are relevant , bring to a metadata repository?</li>
<li>What do      they mean to clinical trial data, what is the benefit, what is in it for      me?</li>
<li>What can      other industries teach us?</li>
</ol>
<p>Hopefully we will reach then end of the questions rather than just generate new ones!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/educating-the-ontologically-challenged-part-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Searching for an eClinical Vision?</title>
		<link>http://www.assero.co.uk/2011/searching-for-an-eclinical-vision/</link>
		<comments>http://www.assero.co.uk/2011/searching-for-an-eclinical-vision/#comments</comments>
		<pubDate>Thu, 16 Jun 2011 05:43:17 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC]]></category>
		<category><![CDATA[data standards]]></category>
		<category><![CDATA[eClinical]]></category>
		<category><![CDATA[FDA]]></category>
		<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=362</guid>
		<description><![CDATA[I am just preparing the first blog covering my journey in to the semantic web and that should be with you later this week or early next week. However, something that has cropped up a few times in recent weeks is the question of vision. In more definite terms, what should we be doing as [...]]]></description>
			<content:encoded><![CDATA[<p>I am just preparing the first blog covering my journey in to the semantic web and that should be with you later this week or early next week. However, something that has cropped up a few times in recent weeks is the question of vision. In more definite terms, what should we be doing as a company, in terms of standards and technology development?</p>
<p>There are no simple answers and a short-ish blog post will not provide detailed answers. I thought it might be interesting to write down a collection of high-level thoughts and see if people find them useful to fashion and guide their own thinking.</p>
<div id="attachment_364" class="wp-caption alignleft" style="width: 256px"><a href="http://www.assero.co.uk/wp-content/uploads/2011/06/Data-Point.jpg"><img class="size-full wp-image-364" title="Data Point" src="http://www.assero.co.uk/wp-content/uploads/2011/06/Data-Point.jpg" alt="" width="246" height="338" /></a><p class="wp-caption-text">Tracing the Data - FDA 2006</p></div>
<p>One goal I keep in my head is based on a presentation I did back in late 2006 to some folks at the FDA, as part of talking about the potential of using the CDISC Operational Data Model (ODM). In particular, I spoke about the “Chain of Custody”. The idea was not only being able to trace forward from the capture of the data through analysis and onwards to submission but also in the opposite direction. I painted a picture of an FDA reviewer being able to view data on screen, click on a data point and instantly see the entire history of that data point, its providence, be it a data point that was captured or some summary statistic or calculation. If it was a captured data point, we would want to see the CRF page, the audit trail of changes and then its flow through the analysis process and the final submission.</p>
<p>Why this scenario? Well I choose it because it encompasses an awful lot of capabilities. It requires electronic data for starters rather than paper, it requires standards at a number of levels such that the data can be understood, stored and transmitted. It requires high-quality data, it requires an audit trail, it requires data in known places that can, as I say, be understood, it requires metadata. It requires many capabilities we are working on, but it places them into context and paints a vision, a vision from the perspective of standards and technology but also from that of the regulator and protecting the public health.</p>
<p>From the pharmaceutical company’s perspective, there is a need for technology and standards to support that “better” process – and we need to define better, quicker, cheaper obviously come to mind but support the mainstream development and submission activities. Companies want to be able to:  understand the data, know all about it, know that the data is traceable and be in a position to answer any query about it from a solid position of knowledge, rather than one of semi-ignorance. It might also be good to be able to find the data when we need to! A company also needs to avoid “data decay”. By this I mean that we want to be able to understand the data in five years, 10 years or 15 years in the same way as today, whether or not the people who worked on it are still around or have moved on to pastures new. And a company would want to be able to be able to use the data: manipulate, aggregate, review and re-examine in ways that we may not have envisaged today.</p>
<p>As an aside from a company’s vision, I would say it is unfortunate that we do not have an industry-wide vision. It is, in my opinion, not a technical challenge as such though there are technology challenges. It is more a question of having the will to build that vision as a collaboration of regulators – note the plural there &#8211; and industry and others engaged in research. It requires a vision stating what is required rather than what should be built. But that is an aside.</p>
<p>Returning to a company vision, the high-level statements I presented above are the sort of statements that help fashion a vision. One advance I have never yet understood is the deployment of clinical data warehouses without the associated, “sufficient and appropriate” metadata. When I said, “use the data: manipulate, aggregate, review and re-examine in ways that we may not have envisaged today,” I cannot envisage that being possible without well-structured metadata. That then brings you to the role of the CDW, is it a simple repository of data such that the data is not lost, or is it the source of institutional knowledge?</p>
<p>A vision should be short and concise. It should guide you in what you need to do and be there to refer to when you are down in the weeds and forgetting why you are there in the first place. Acting without a vision is rather like trying to build a car without knowing what a car is or why you need it; in the end the resulting “product” won’t do the jobs you need it to do and, more likely, actually hinder you.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/searching-for-an-eclinical-vision/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Educating the Ontologically Challenged &#8230;</title>
		<link>http://www.assero.co.uk/2011/educating-the-ontologically-challenged/</link>
		<comments>http://www.assero.co.uk/2011/educating-the-ontologically-challenged/#comments</comments>
		<pubDate>Thu, 26 May 2011 07:46:21 +0000</pubDate>
		<dc:creator>Dave IH</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[CDISC SHARE]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://www.assero.co.uk/?p=342</guid>
		<description><![CDATA[&#8230; Or Making Dave Iberson-Hurst Ontologically Aware! Triples, RDF, Owl, Ontology, Linked Data, Semantic Web. I’ll stop because I think I have just exhausted my knowledge of the semantic web; I am not even sure that I have the right title for this Blog. What I do know is that the topic of the “semantic [...]]]></description>
			<content:encoded><![CDATA[<p><strong>&#8230; Or Making Dave Iberson-Hurst Ontologically Aware!</strong></p>
<p>Triples, RDF, Owl, Ontology, Linked Data, Semantic Web. I’ll stop because I think I have just exhausted my knowledge of the semantic web; I am not even sure that I have the right title for this Blog. What I do know is that the topic of the “semantic web” is important. As part of volunteering on the <a title="CDISC SHARE" href="http://www.cdisc.org/cdisc-share" target="_blank">CDISC SHARE</a> project and building <a title="Metadata" href="http://www.assero.co.uk/metadata/" target="_blank">the metadata repository</a>, we will need to be aware of what this could bring and build in the necessary hooks to do so, may be not for the first release, but certainly at some time in the future.</p>
<p>But the challenge is to understand it and get to grips with it. And so it is very fortunate that I have met someone who knows about these things and is prepared to try and make me “ontologically aware.” Kerstin Forsberg is someone I have met a few times now through various projects. <a title="Kerstin Forsberg - LinkedIn" href="http://www.linkedin.com/profile/view?id=3246450&amp;authType=name&amp;authToken=FYjG&amp;goback=.con" target="_blank">Kerstin works for AstraZeneca in Goteborg</a> and has considerable experience in the field and she has very kindly agreed to educate the dumb Englishman in this important field and we both felt that it would be useful to invite you along for the journey.</p>
<p>I like to keep things simple so I thought I would pose a few questions that Kerstin can try and answer and explain the various issues, technologies and other aspects. I will document the answers over a few blog posts and share them so that everyone can benefit.</p>
<p><strong>So the questions</strong></p>
<ol>
<li>What is the semantic web and linked data?</li>
<li>What is an ontology?</li>
<li>Where do RDF and OWL fit?</li>
<li>What could these things, if they are relevant , bring to a metadata repository?</li>
<li>What do they mean to clinical trial data, what is the benefit, what is in it for me?</li>
<li>What can other industries teach us?</li>
</ol>
<p>Finally, I would like to thank Kerstin for helping the ontologically challenged people like myself.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.assero.co.uk/2011/educating-the-ontologically-challenged/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

