Thomson Data Analyzer
- With what operating system is Thomson Data Analyzer compatible?
- How fast a processor and how much memory do I need?
- How do I download Thomson Data Analyzer?
- Can I get a copy on CD-ROM?
- How do I install Thomson Data Analyzer
- Can I install Thomson Data Analyzer on more than one computer?
- What text-mining tools are available?
- Can I use Thomson Data Analyzer with a non-Thomson Scientific data source?
- What is list cleanup?
- What does the thesaurus function do?
- Can I create my own thesaurus file?
- How can I combine two datasets?
- Can I delete records within the search set if I decide they are irrelevant for my analysis?
- What do the Thomson Data Analyzer scripts do?
- Can I change the reports or record my own macros?
- How many records can be analyzed?
- How does Thomson Data Analyzer work with Aureka?
- Can I trial the software?
- Can I apply my own classifications?
- What training is available on TDA?
- How can we set up internal and other commercial databases to work with TDA?
- Does my license include product updates?
- Can I distribute the results from TDA within my organisation?
Q: With what operating system is Thomson Data Analyzer compatible?
A: Thomson Data Analyzer will run on Windows 2000, and Windows XP. There are currently no plans to create a MacOS or Linux version. You may need local administrative permissions for the macros to run. Some security settings will prevent one program (TDA) from opening another (Excel).
Q: How fast a processor and how much memory do I need?
A: We recommend at least a 500 MHz processor. Because Thomson Data Analyzer is processor intensive, we recommend using as fast a processor as possible. At an absolute minimum, we suggest 128 Mb of RAM. If you will be processing large files (1000+ records), we recommend at least 512 Mb RAM.
Q: How do I download Thomson Data Analyzer?
A: Once you have become as a customer, you can download Thomson Data Analyzer from the secure Search Technology website at http://www.thevantagepoint.com/downloads.
Q: Can I get a copy on CD-ROM?
A: Yes - simply select this option when you order.
Q: How do I install Thomson Data Analyzer
A: Simply double-click on the Thomson Data Analyzer '.msi' file that you obtained from the download site or CD-ROM to run the set-up program. The setup program will install Thomson Data Analyzer on your hard drive in the C:\Program Files\Thomson Data Analyzer directory. It will also create a program group in your Start menu and assign files with a .vpt extension to be opened by Thomson Data Analyzer.
Q: Can I install Thomson Data Analyzer on more than one computer?
A: To install Thomson Data Analyser on more than computer, you need to purchase additional licenses. The subscription contract prohibits the installation of the software on more than one computer.
Q: What text-mining tools are available?
A: Thomson Data Analyzer obtains words/phrases from text-mining via Natural Language Processing (NLP). The NLP fields extract words or phrases from longer blocks of text (like titles and abstracts) and put them in a list. In Derwent records, the title and abstract phrases work well as faux keywords. These NLP fields can then be used as every other field – they can be cleaned, listed, used in matrices or mapped.
Q: Can I use Thomson Data Analyzer with a non-Thomson Scientific data source?
A: Yes. Thomson Data Analyzer allows new ‘import filters’ to be built for most structured data sources. Advanced users may wish to do this themselves, or Thomson can build and support new filters for you. (You may need to check the data licensing agreement with the information provider before using it in an analytical tool).
Q: What is list cleanup?
A: List cleanup is an automated process that uses fuzzy matching algorithms to match varieties of the same term. This can be limited to matching terms that only differ in case, punctuation or stem (plurals), or it can be expanded to match phrases that match most of their words (e.g. different divisions of a company).
Q: What does the thesaurus function do?
A: The thesaurus function acts as a “find and replace” on the fields. For example, you can create thesauri to replace a set of country abbreviations or classification codes with their full definitions. The use of “Regular Expression” for pattern matching allows great flexibility here, e.g. identifying patent authority and kind code to ascertain grant or application status. Thomson Data Analyzer comes with a variety of thesauri for converting IPC, Derwent Class, Derwent Manual Codes and common abbreviations.
Q: Can I create my own thesaurus file?
A: Yes, there are several ways to create a thesaurus file. You can create them from groups, the thesaurus editor or the list cleanup function. You’ll find details on each method in the product’s Help.
Q: How can I combine two datasets?
A: You can combine two sets containing different records (e.g. an update to an existing dataset) by using “Data Fusion”. This works even for records from different types of data, so that similar fields can be aligned, e.g. Inventor field from Patents with Author field from Literature. You can also combine two sets containing different information about the same records (e.g. DWPI and PatentWeb) so that the fields combine into a ‘master’ record – this is called “Record Fusion”.
Q: Can I delete records within the search set if I decide they are irrelevant for my analysis?
A: You can mark records with "Omit from Dataset", and they will not be included in any new datasets you create. You cannot delete individual records from the dataset. However, you can create new datasets based on only those records that you are interested in by using the group function. Until you create a new dataset, the records you have marked will continue to be counted in any lists, matrices, maps or macros based on your original dataset.
Q: What do the Thomson Data Analyzer scripts do?
A: Scripts are included which will produce detailed Excel reports regarding:
- The portfolio of a single company (what they do, who works for them, who they work with, trends etc.)
- The relative position of up to 5 companies (areas of uniqueness and commonality)
- A technology area (key individuals and companies, sub-technologies, trends etc.)
There are also a variety of simple scripts to help save time with data cleaning, exporting and simple reporting
Q: Can I change the reports or record my own macros?
A: The advanced reports are encrypted and cannot be modified. If you wish to create your own scripts, Thomson Data Analyzer uses the Visual Basic Scripting language (VBScript) and the script commands are documented in the "Automation & Scripts" section of the helpfile. Some of the more simple macros are not encrypted, and so can help you learn about writing them.
Q: How many records can be analyzed?
A: Thomson Data Analyzer has no upper limit, although the performance of your computer will produce a limit in practice. Normal operation on the full range of fields in patent and literature databases is comfortable with tens of thousands of records. Beyond that, you can use certain techniques to improve performance, such as being selective about the fields to import. Tests have shown that by restricting the dataset to only the key fields, analyses of 250,000 records can be performed on ‘everyday’ computers.
Q: How does Thomson Data Analyzer work with Aureka?
A: Data from Aureka can be imported, cleaned and analyzed within Thomson Data Analyzer. Cleaned data from Thomson Data Analyzer can be exported for use as a Corporate Document in Aureka (e.g. for ThemeScape maps). The two products are highly complementary and many customers use both.
Q: Can I trial the software?
A: Yes, you can try the software for one month free of charge – just contact your account manager or local Thomson Scientific office.
Q: Can I apply my own classifications?
A: Yes. Thomson Data Analyzer allows classification schemes to be uploaded, assigned to single or multiple records and analyzed.
Q: What training is available on TDA?
A: We provide a range of Webex and on-site training for all levels of user, tailored to meet each individual customer’s needs and workflow, to help not only with using the product but also with best-practice of creating and presenting analyses.
Q: How can we set up internal and other commercial databases to work with TDA?
A: Advanced users may wish to use the powerful Import Engine Editor to create their own ‘import filters’. Alternatively, Thomson will be able to work with customers in a confidential environment to build and support these import filters on behalf of them. Each import filter contains rules for identifying individual records, fields and items.
Q: Does my license include product updates?
A: Yes it does. Upgrades include new reports (automated scripts), thesauri and import filters for common databases, as well as updates to the core software itself.
Q: Can I distribute the results from TDA within my organisation?
A: Yes. Many decision-makers within an organization prefer reports in a familiar format, and TDA is designed throughout to communicate with MS Office products. As well as the detailed reports in Excel, normal use of the Windows clipboard is supported throughout. We also offer the Thomson Data Analyzer Reader edition for users with licenses for multiple users. This allows others to review analyses produced in the full version of Thomson Data Analyzer and add classifications to records.