Tech Mining for Emerging STI Trends Through Dynamic Term Clustering and Semantic Analysis: The Case of Photonics
Technology mining (TM) helps to acquire intelligence about the evolution of research and development (R&D), technologies, products, and markets for various STI areas and what is likely to emerge in the future by identifying trends. The present chapter introduces a methodology for the identification of trends through a combination of “thematic clustering” based on the co-occurrence of terms, and “dynamic term clustering” based on the correlation of their dynamics across time. In this way, it is possible to identify and distinguish four patterns in the evolution of terms, which eventually lead to (i) weak signals of future trends, as well as (ii) emerging, (iii) maturing, and (iv) declining trends. Key trends identified are then further analyzed by looking at the semantic connections between terms identified through TM. This helps to understand the context and further features of the trend. The proposed approach is demonstrated in the field photonics as an emerging technology with a number of potential application areas.
Technology foresight has been increasingly undertaken by developing countries to identify technologies whose adoption might serve as a platform for future economic growth. However, foresight activities have not, by and large, resulted in well-developed policy initiatives. Three factors are relevant for improvement. First, foresight activities would benefit from being more informed by the convergence literature and global convergence experience over the past several decades, and should therefore incorporate organically the concepts of absorptive capacity and technology gap into foresight exercises. Second, certain preconditions – in particular the existence of a functional national innovation system – enhance the likelihood that foresight exercises will be successful. Third, in order to achieve wide buy-in and promote the sustainability of initiatives generated by the foresight activity, developing countries are advised to consult widely in the foresight process. Policies emanating from foresight activities should additionally address two core challenges: a) a clear definition of those technologies that should be developed internally vs. those that should be sourced from abroad and b) identification of the internal capabilities to be developed in conjunction with those technologies targeted for acquisition from abroad.
This article represents a new technique for collaborative filtering based on pre-clustering of website usage data. The key idea involves using clustering methods to define groups of different users.
This is a textbook in data analysis. Its contents are heavily influenced by the idea that data analysis should help in enhancing and augmenting knowledge of the domain as represented by the concepts and statements of relation between them. According to this view, two main pathways for data analysis are summarization, for developing and augmenting concepts, and correlation, for enhancing and establishing relations. Visualization, in this context, is a way of presenting results in a cognitively comfortable way. The term summarization is understood quite broadly here to embrace not only simple summaries like totals and means, but also more complex summaries such as the principal components of a set of features or cluster structures in a set of entities.
The material presented in this perspective makes a unique mix of subjects from the fields of statistical data analysis, data mining, and computational intelligence, which follow different systems of presentation.
Science, technology and innovation (STI) involves numerous policy fields which are championed by different government ministries or agencies. A consistent and coherent anticipatory policy mix is understood to be one that ensures a timely development and implementation of various forward-looking policy instruments. Such timely implementation is crucial for the eventual impact of the policy measures. This also requires that foresight for STI policies looks beyond the potential development paths and challenges but includes the time dimension and the outline of necessary policy responses including a relevant implementation framework. In addition the institutions which are part of the National Innovation Systems (NIS) should to be considered thoroughly for a well-balanced and comprehensive policy mix. Not only national but also regional and local actors need to be involved—and they need to be involved not only in the implementation of policy but at much earlier stages in the foresight and subsequent design procedures of the policy mix. One practical approach for convincing and engaging NIS actors at different levels is to stress opportunities which offer advantages to each of them, instead of just focusing on challenges and problems.
Our research aims at automatic identification of constructions associated with particular lexical items and its subsequent use in building the catalogue of Russian lexical constructions. The study is based on the data extracted from the Russian National Corpus (RNC, http://ruscorpora.ru). The main accent is made on extensive use of morphological and lexico-semantic data drawn from the multi-level corpus annotation. Lexical constructions are regarded as the most frequent combinations of a target word and corpus tags which regularly occur within a certain left and/or right context and mark a given meaning of a target word. We focus on nominal constructions with target lexemes that refer to speech acts, emotions, and instruments. The toolkit that processes corpus samples and learns up the constructions is described. We provide analysis for the structure and content of extracted constructions (e.g. r:ord der:num t:ord r:qual|pervyj ‘first’ + LJUBOV’ ‘love’; LJUBOV’ ‘love’ + PR|s ‘from’ + ANUM m sg gen|pervyj ‘first’ + S f inan sg gen|vzgljad ‘sight’ = love at first sight). As regards their structure, constructions may be considered as n-grams (n is 2 to 5). The representation of constructions is bipartite as they may combine either morphological and lemma tags or lexical-semantic and lemma tags. We discuss the use of visualization module PATTERN.GRAPH that represents the inner structure of extracted constructions.
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use an approach based on computing (closed) sets of attributes having large support (large extent) as clusters of similar documents. The method is tested in a series of computer experiments on large public collections of web documents and compared to other established methods and software, such as biclustering, on same datasets. Practical efficiency of different algorithms for computing frequent closed sets of attributes is compared.
This book presents research dedicated to solving scientific and technological problems in many areas of electronics, photonics and renewable energy. Progress in information and renewable energy technologies requires miniaturization of devices and reduction of costs, energy and material consumption. The latest generation of electronic devices is now approaching nanometer scale dimensions; new materials are being introduced into electronics manufacturing at an unprecedented rate; and alternative technologies to mainstream CMOS are evolving. The low cost of natural energy sources have created economic barriers to the development of alternative and more efficient solar energy systems, fuel cells and batteries.
Nanotechnology is widely accepted as a source of potential solutions in securing future progress for information and energy technologies. Nanoscale Materials and Devices for Electronics, Photonics and Solar Energy features chapters that cover the following areas: atomic scale materials design, bio- and molecular electronics, high frequency electronics, fabrication of nanodevices, magnetic materials and spintronics, materials and processes for integrated and subwave optoelectronics, nanoCMOS, new materials for FETs and other devices, nanoelectronics system architecture, nano optics and lasers, non-silicon materials and devices, chemical and biosensors,quantum effects in devices, nano science and technology applications in the development of novel solar energy devices, and fuel cells and batteries.
Foresight has gained much attention as a tool for developing and informing science, technology and innovation policy and company strategies. It is frequently used for detecting not only potential development paths of technologies but also possible economic and societal changes; and for identifying challenges that nations, societies and companies might face in the future. Raising awareness within the respective communities of trends and challenges is critically important—and the biggest challenge is how we can develop measures to meet these anticipated challenges. Paradoxically, perhaps, it may be more helpful for creating and implementing successful measures if these are elaborated by thinking about grasping opportunities, rather than framing them in terms of threats that have to be responded to. Accordingly there is a need to change the mindsets in science, technology and innovation policy making—and to engender solution and opportunity orientation among scientists and engineers.
Tech Mining, a special form of “Big Data” analytics, aims to generate Competitive Technical Intelligence (CTI) using bibliometric and text-mining software (e.g., VantagePoint, TDA) as well as other analytical & visualization applications for analyses of Science, Technology & Innovation (ST&I) information resources. The goal of the conference is to ENGAGE cross-disciplinary networks of analysts, software specialists, researchers, policymakers, and managers toADVANCE the use of textual information in multiple science, technology, and business development fields. The conference program will address key CHALLENGES in:
DataSourcing, preparing, and interpreting data sources including patents, publications, webscraping, and other novel data sources
Text-mining tools and methodsBest practices in software-based topic modeling, clumping, association rules, term manipulation, text manipulation, etc. Visualization
Applied researchFuture-Oriented Technology Analysis (FTA) Intelligence gathering to support decision-making in the private sector (e.g., Management of Technology)
This conference is intended for researchers and students across multiple fields, especially Scientometrics, Public Policy, Management of Technology and Information Science.