DataONE:Notebook/Reuse of repository data/2010/06/22
Reuse of Repository Data | Main project page Previous entry Next entry |
Notes for June 22, 2010
Abstract Brainstorming
Publication Brainstorming
Transcript of Conversation with Heather[16:51] Heather Piwowar: Hi Valerie! [16:51] Valerie Enriquez: Hi [16:51] Valerie Enriquez: what's going on? [16:51] Heather Piwowar: Had a thought about next steps for your research.... [16:51] Heather Piwowar: left a blurb on your talk page [16:51] Valerie Enriquez: yeah, I could use a bit of pointing in the analysis department [16:51] Valerie Enriquez: thanks [16:51] Heather Piwowar: want to chat to flush it out and see what you think? [16:52] Heather Piwowar: I've got 10 minutes now, or we could chat tomorrow morning early? [16:52] Valerie Enriquez: (if I randomly sign off, that means my computer has once more overheated despite it not being that ot here) [16:52] Valerie Enriquez: sure [16:52] Valerie Enriquez: I could talk now or tomorrow [16:52] Heather Piwowar: no prob. my wireless randomly drops, I understand [16:52] Heather Piwowar: ok let's do a bit now and then more tomorrow [16:52] Valerie Enriquez: ok, sounds good [16:52] Valerie Enriquez: right before the meeting tomorrow maybe? [16:53] Heather Piwowar: oh, hrm, do we have a meeting tomorrow? [16:53] Valerie Enriquez: oh wait [16:53] Heather Piwowar: we did last week that is true [16:53] Heather Piwowar: I wasn't thinking we would tomorrow though. [16:53] Valerie Enriquez: did we agree that we're not having a meeting? [16:53] Valerie Enriquez: oh yeah, sorry [16:54] Heather Piwowar: I'll send an email to confirm: no meeting. [16:54] Valerie Enriquez: either way, a bit now, a bit tomorrow [16:54] Valerie Enriquez: cool [16:54] Heather Piwowar: ok, here's the blurb: [16:54] Heather Piwowar: Valerie, just talked to Todd and we had a thought. It seems like generating effective searches for reuse is really difficult. What makes a search effective varies by repository due to repository names, support for dois, etc. Evidence of the difficulties would be very useful/motivating for initiatives like datacite, and interesting to all people who submit data. As such, describing the difficulties in formulating effective searches for reuse, using three repositories as examples, would make a great publication in and of itself. Maybe a research article, or a perspectives piece, or ??? And insights from the write-up could inform how to proceed for the last few weeks of your internship. See what you think, consider some potential publishing venues that might be appropriate for a case study like this, and let's chat sometime on Wednesday about it? [16:54] Valerie Enriquez: I like that idea [16:54] Heather Piwowar: cool. [16:54] Valerie Enriquez: mainly because what I've found has been spotty [16:54] Heather Piwowar: want to say it back to me to make sure we are thinking the same thing? [16:54] Heather Piwowar: :) [16:55] Valerie Enriquez: ok, basically, a writeup about the various search methods I've used and a commentary on how difficult it is to find citations for reused data [16:55] Heather Piwowar: yup. that. [16:55] Heather Piwowar: not sure how quantitative. [16:55] Valerie Enriquez: that sounds more up to my speed [16:55] Heather Piwowar: probably at least a little bit quantitaive [16:55] Valerie Enriquez: ah, yeah, I had questions about how I'd measure that [16:55] Heather Piwowar: along the lines of what you've found [16:55] Valerie Enriquez: would I use R? [16:55] Heather Piwowar: nah, not necessarily. [16:56] Valerie Enriquez: ok [16:56] Heather Piwowar: so I'd imagine that as a perspective piece it would mostly be case study approach.... [16:56] Heather Piwowar: so saying "if I do this search here, 60% of my hits are for data creation" [16:56] Heather Piwowar: etc [16:56] Valerie Enriquez: ah [16:57] Heather Piwowar: whereas if I try that search that worked with treebase for pangaea, all I get it hits for supercontinents. [16:57] Valerie Enriquez: my main worry was that my searches turned out to bee too specific to provide a useful sample size of articles [16:57] Valerie Enriquez: ah, yeah [16:57] Heather Piwowar: or "dois work well for xxxx, but aren;t supprted in searching for reuse out of repositories A, B, C" [16:57] Heather Piwowar: yeah, agreed. [16:58] Valerie Enriquez: I have 75 articles so far [16:58] Heather Piwowar: so instead if we decide the weakness is a strength, and figure out how to tell the story of how difficult it is, and what gotchas you run into... [16:58] Valerie Enriquez: cool [16:58] Heather Piwowar: yeah, not many. [16:58] Heather Piwowar: so maybe we make the story about the searching, instead. [16:59] Heather Piwowar: with some numbers, to quantify the relative difficulty of getting hits, but mostly not. [16:59] Heather Piwowar: is the scope of journals that you read, is there a natural home for this sort of article? [17:00] Valerie Enriquez: well, the ISI searches went across a whole bunch of different journals [17:00] Valerie Enriquez: I'm going through again to check whether or not they're open source [17:00] Heather Piwowar: .... hmm, let me take back the previous "mostly not." The article should be quantitative enough to make its point and provide useful evidence. [17:00] Valerie Enriquez: ok [17:01] Valerie Enriquez: like mentioning that one search brought up 1000 results [17:01] Heather Piwowar: can you think of places you'd want to publish an article like this? [17:01] Heather Piwowar: (since thinking about the first-choice submission place always helps to focus the tone/length/etc) [17:02] Valerie Enriquez: hm. it might be useful in a library publication relating to reference (particularly scientific reference), but I'd have to look further into that [17:02] Valerie Enriquez: ok [17:02] Heather Piwowar: yes. I'll brainstorm too. [17:02] Heather Piwowar: maybe put a place on the wiki where you are collecting ideas, and I'll add to it and then others can too. [17:02] Valerie Enriquez: ok, I could put a comment box on the main page [17:03] Valerie Enriquez: (er, the main page of my notebook) [17:03] Heather Piwowar: so the idea would be that you'd maybe wrap up exploring search alternatives (especially DOIs) for DAAC [17:03] Heather Piwowar: then outline and write this piece, roughtly, maybe in time for the July meeting. [17:03] Heather Piwowar: what do you think, woudl that be doable? [17:03] Valerie Enriquez: Definitely [17:03] Heather Piwowar: then discussion about that woudl lead to where to focus for the last few weeks of your internship. [17:04] Valerie Enriquez: excellent [17:04] Heather Piwowar: on the 75 articles, or for example on following the reuse patterns of a few select datasets, or ??? [17:04] Heather Piwowar: cool. [17:04] Heather Piwowar: well I don't know your usual manuscript-drafting approach, [17:04] Heather Piwowar: but if you usually write an abstract first, or outline first, or something [17:05] Valerie Enriquez: I've been brainstorming abstract ideas in my past couple of journal entries [17:05] Heather Piwowar: then we can give you early feedback. [17:05] Valerie Enriquez: but thanks to todays meeting, I have more of a direction now [17:05] Heather Piwowar: yes, I saw that. good stuff. [17:05] Heather Piwowar: good. [17:06] Heather Piwowar: ok, super. then I'll keep an eye on your pages, and will give a shout-out to Todd and others when you have a direction gelling... [17:06] Heather Piwowar: jelling? I have no idea. [17:06] Heather Piwowar: anyway. [17:06] Valerie Enriquez: excellent, thanks! [17:06] Heather Piwowar: good. [17:07] Heather Piwowar: let me know if/when you get stuck or want someone to bounce ideas off of for direction/focus/venue/whatever. [17:07] Valerie Enriquez: ok, I will [17:07] Heather Piwowar: I'll be online a bit less at the end of the week, but still available even if via email. [17:07] Heather Piwowar: bye! [17:07] Valerie Enriquez: bye! |