DataONE:Notebook/ArticleCitationPractices

{| width="800"
 * style="background-color: #cdde95;" align="center"|
 * style="background-color: #cdde95;" align="center"|




 * align="center" style="background-color: #e5edc8;" |

title=Search this Project


 * colspan="2" style="background-color: #F2F2F2;" align="right"|Customize your entry pages 
 * colspan="2"|
 * colspan="2"|
 * colspan="2"|

Project Description

 * An investigation of if and how individual articles cite datasets.
 * Extraction of data from top tier journals with existing data citation policies or affiliation with existing data depositories.
 * This data will be used to examine the influence of author background, discipline, time, open access, journal data sharing policies and other factors on the rate of data reuse/sharing.
 * Complete listing of current extracted fields is housed on google docs. These are open to suggestion (which are greatly appreciated!) via comments on this page or the google doc. The extracted data for each article is currently housed on my personal computer and will be posted soon.
 * Part of a collaborative project with other DataONE interns, focusing on journal/repository metadata (Nic) and reuse of depository datasets (Valerie)
 * See the DataONE group Open Notebook for more details.]

Current Protocol

 * Full details here
 * Download journal articles from concurrent issues of selected journals.
 * Journals: American Naturalist, Systematic Biology, Molecular Ecology, Ecology, - still deciding on more....please leave me a note for further suggestions to cover the disciplinary spectrum of biology, specifically environmental and earth related journals. (currently considering: Evolution, Genetics, Paleobiology, something GIS/earth, Nature, Science)
 * Time: published in 2010
 * Data Extraction
 * Read through articles manually, with special attention to the Methods and Acknowledgements sections.
 * Record information about author, abstract, discipline, funding source, and most importantly, dataset citation (reuse and sharing).
 * Specific fields collected
 * Google Spreadsheet - Revised
 * [[Media:WorkingSpreadsheet_21July2010_DataCollectionComplete.xls]] (Note: This is the desktop version I work from on a daily basis and then reupload to google docs. It had intact formulas and hyperlinks, especially for populating fields from ISI export)
 * old versions: [[Media:WorkingSpreadsheet_16July2010.xls]], [[Media:WorkingSpreadsheet_2July2010.xls]], [[Media:WorkingSpreadsheet.xls]]
 * Google Spreadsheet - Field Explanations
 * Google Spreadsheet - Old
 * Collect Issue metadata: included Supplementary Data, Open Access, Article "type" (differs between journals)
 * Data coding
 * Most data is collected in full sentence or paragraphs of relevant information. This is then coded or used to calculated a standardize field.
 * This will (hopefully) progress to an automated database or coding technique.
 * Working on automating/expediting this entire process primarily through an Access database and integration with Zotero citation software. Please leave any suggestions on relevant software (text searching, coding, db) and methods.

Notebook
I am currently in the process of posting all my research notes here. Also, see Correspondence among the entire DataONE group for additional notes.


 * Plan
 * Article Post-Extraction Notes
 * Oddities To Resolve
 * Methods
 * Repositories
 * Database
 * Questions
 * Analysis
 * Manuscript
 * Original Research Questions


 * colspan="2" style="background-color: #F2F2F2;"|
 * colspan="2" style="background-color: #F2F2F2;"|


 * }