Importing external data
You can import external citation data into Publish or Perish from the following sources and formats:
Source | Supported formats |
---|---|
Publish or Perish | All export formats except BibTeX |
Google Scholar/Citations | CSV, EndNote, RefMan/RIS |
Scopus | Comma-separated file (Excel et al.) , RefMan/RIS |
Web of Science | All export formats except Mac-based |
Various sources | EndNote Save All Fields, EndNote tagged, RefMan/RIS |
Below are the details of each format as Publish or Perish implements them. This is for reference only; in general you do not need to know these details if you are importing the data from other software that produces at least one of these formats.
Note: To import data that you have previously archived through the Export to Archive command, see Archiving your data.
How to import external data
You can import externally produced citation data as follows.
- Go to the Multi-query center.
- Select the query folder in which to import the external data.
- Do one of the following:
- Choose File > Import External Data... from the main menu, or
- Right-click on the folder, then choose New Import..., or
- Click on the New Import toolbar button, or
- Press Ctrl+O
- In the Open dialog box that appears, browse for and select the external data file(s) to import, then click on the Open button. Note that you can select and import multiple files at once; each file will appear as a separate data import in Publish or Perish.
- Confirm each import in the External Data Properties dialog box. If you cancel the dialog box, the data will not be added to Publish or Perish.
Publish or Perish data formats
Publish or Perish can re-import all data produced by itself, except the BiBTeX export format. In particular, the following Publish or Perish data formats are supported.
Format name | Field encoding | Notes |
---|---|---|
BiBTeX | Record-oriented, free layout | Not supported |
CSV export | Comma-separated, one line per record | Data as exported by Publish or Perish in CSV format |
CSV results | Comma-separated, one line per record | Results as stored in the %APPDATA%\Publish or Perish\Results3 folder |
EndNote export | Tagged, multiple lines per record | Data as exported by Publish or Perish in EndNote format |
ISI/WoS export | Tagged, multiple lines per record | Data as exported by Publish or Perish in ISI/Web of Science format |
RefMan/RIS export | Tagged, multiple lines per record | Data as exported by Publish or Perish in RefMan/RIS format. Both old-style and new-style (Publish or Perish 4.0.1+) tags are supported. |
Google Scholar/Citations data formats
Apart from querying Google Scholar directly, Publish or Perish can also import data exported from Google Scholar and Google Citations. The following table shows which Google export formats are, or are not, supported by Publish or Perish.
Format name | Field encoding | Notes |
---|---|---|
BiBTeX | Record-oriented, free layout | Not supported |
CSV | Comma-separated, one line per record | Supported; see below |
EndNote | Tagged, multiple lines per record | Supported; see below |
RIS format | Tagged, multiple lines per record | Supported; see below |
Google CSV format
From the Google CSV format, Publish or Perish imports the following fields.
Field name | Imported to | Notes |
---|---|---|
Authors | Author names | Author names are reformatted according to Google Scholar/Publish or Perish conventions |
Title | Title of the entry | |
Publication | Title of the journal | |
Year | Year of publication | |
Publisher | Publisher name |
Scopus data formats
The following table shows which Scopus export formats are, or are not, supported by Publish or Perish.
Format name | Field encoding | Notes |
---|---|---|
CSV | Comma-separated, one line per record | Supported; see below |
RIS format | Tagged, multiple lines per record | Supported; see below |
RefWorks direct export | Free form, semi-structured | Not supported |
Text | Free form, semi-structured | Not supported |
Scopus CSV format
From the Scopus CSV format, Publish or Perish imports the following fields.
Note: Scopus produces two versions of its CSV export format: Complete and Cites only.
- In the former, author lists are formatted as LastName Inits, LastName Inits,... that is, with commas between successive authors.
- The latter uses a variation of this format and exports LastName, Inits, LastName, Inits,... with commas between authors, but also between each last name and the corresponding initials.
This makes parsing the authors list ambiguous and is probably a bug in Scopus' Cites only export algorithm. To work around this bug, Publish or Perish applies several heuristics to determine the most probable interpretation of the authors' names and reformats the names accordingly when the data are imported.
Field name | Imported to | Notes |
---|---|---|
Authors | Author names | Author names are reformatted according to Google Scholar/Publish or Perish conventions |
Cited by | Number of citations | |
Document Type | Publication type | From Publish or Perish 4.0.7 onward |
Link | Article URL | |
Publisher | Publisher name | |
Source title | Title of the journal | |
Title | Title of the entry | |
Year | Year of publication |
Web of Science data formats
The following table shows which Web of Science (formerly known as ISI) export formats are, or are not, supported by Publish or Perish.
Format name | Field encoding | Text encoding | Line ending | Notes |
---|---|---|---|---|
EndNote | Tagged, multiple lines per record | ANSI | Unix (\n) | Not recommended (may cause problems with non-English names and text) |
Other Reference Software | Tagged, multiple lines per record | Unicode UTF-8, BOM | Unix (\n) | Recommended format |
Plain text | Tagged, multiple lines per record | Unicode UTF-8, BOM | Unix (\n) | Appears to be identical to Other Reference Software format |
Tab-delimited (Mac) | Tab-separated, one line per record | Unicode UTF-16, BOM | Mac (\r) | Not supported |
Tab-delimited (Mac, UTF-8) | Tab-separated, one line per record | Unicode UTF-8, BOM | Mac (\r) | Not supported |
Tab-delimited (Win) | Tab-separated, one line per record | Unicode UTF-16, BOM | DOS (\r\n) | Supported |
Tab-delimited (Win, UTF-8) | Tab-separated, one line per record | Unicode UTF-8, BOM | DOS (\r\n) | Supported |
Web of Science EndNote format
The EndNote format as produced by Web of Science uses the same tags as its Other Reference Software format, but encodes its text in the current code page. This is not recommended. We recommend that you use the Other Reference Software format instead.
Web of Science Other Reference (and Plain text) format
From the Web of Science Other Reference Software (or Plain text) format, Publish or Perish imports the following tags.
Tag name | Imported to | Notes |
---|---|---|
AU | Author names | Multiple lines (one for each author) are combined into a single authors list and reformatted according to Google Scholar/Publish or Perish conventions |
PT | Publication type | From Publish or Perish 4.0.7 onward |
DT | Document type | If available, replaces PT (publication type) from Publish or Perish 4.19.0 onward |
PU | Publisher name | |
PY | Year of publication | |
SO | Title of the journal | |
TC | Citation count | |
TI | Title of the entry | |
UR | Article URL |
Web of Science Tab-delimited format
From the Web of Science Tab-delimited format, Publish or Perish imports the following fields.
Field name | Imported to | Notes |
---|---|---|
AU | Author names | Author names are reformatted according to Google Scholar/Publish or Perish conventions |
PT | Publication type | From Publish or Perish 4.0.7 onward |
DT | Document type | If available, replaces PT (publication type) from Publish or Perish 4.19.0 onward |
PU | Publisher name | |
PY | Year of publication | |
SO | Title of the journal | |
TC | Citation count | |
TI | Title of the entry |
EndNote Save All Fields format
From the EndNote Save All Fields format, Publish or Perish imports the following fields.
Field name | Imported to | Notes |
---|---|---|
Author | Author names | Author names are reformatted according to Google Scholar/Publish or Perish conventions |
Book title | Title of the journal | |
Journal | Title of the journal | |
Publisher | Publisher name | |
Reference Type | Publication type | From Publish or Perish 4.0.7 onward |
Title | Title of the entry | |
Year | Year of publication |
EndNote Tagged format
From the EndNote Tagged format, Publish or Perish imports the following tags.
Tag name | Imported to | Notes |
---|---|---|
%0 | Publication type | From Publish or Perish 4.0.7 onward |
%1 | Citation count, citations URL, query date | Produced by Publish or Perish's own exports |
%A | Author names | Author names are reformatted according to Google Scholar/Publish or Perish conventions |
%B | Title of the journal | |
%D | Year of publication | |
%I | Publisher name | |
%J | Title of the journal | |
%T | Title of the entry | |
%U | Article URL |
RefMan/RIS format
From the RefMan/RIS format, Publish or Perish imports the following tags.
Tag name | Imported to | Notes |
---|---|---|
A1, AU | Author names | Multiple lines (one for each author) are combined into a single authors list and reformatted according to Google Scholar/Publish or Perish conventions |
JF, JO | Title of the journal | Alternates for T2 tag |
M1 | Citation count, citations URL, query date | Produced by Publish or Perish's own exports (as multiple M1 lines) |
N1 | Citation count, query date | Produced by Scopus exports (as multiple N1 lines) |
PB | Publisher name | |
PY | Year of publication | Alternate for Y1 tag |
T1, TI | Title of the entry | |
T2 | Title of the journal | Alternate for JF and JO tags |
TY | Publication type | From Publish or Perish 4.0.7 onward |
UR | Article URL | |
Y1 | Year of publication | Alternate for PY tag |