A Study of Knowledge Integration in Digital Humanities Based on Bibliographic Analysis Tang Muh-Chyun Dept. of Librarya and Information Science, National Taiwan University, Taiwan, Republic of China muhchyun.tang@gmail.com Cheng Yun Jen Dept. of Librarya and Information Science, National Taiwan University, Taiwan, Republic of China yjcheng0314@gmail.com Chen Kuang Hua Dept. of Librarya and Information Science, National Taiwan University, Taiwan, Republic of China khchen@ntu.edu.tw Hsiang Jieh Dept. of Computer Science and Information Engineering National Taiwan University jieh.hsiang@gmail.com 2016-03-07T09:35:00Z Maciej Eder, Pedagogical University in Krakow Jan Rybicki, Jagiellonian University
Institute of Polish Studies Pedagogical University ul. Podchorazych 2 30-084 Krakow, Poland maciej.eder@ijp-pan.krakow.pl

Converted from a Word document

Paper Short Paper network analysis cohesion digital humanities bibliographic analysis bibliographic methods / textual studies interdisciplinary collaboration visualisation networks, relationships, graphs English

This is an extension of our previous work in which a longitudinal (1989-2014) co-citation analysis of literature in Digital Humanities was (DH) conducted to explore the degree of interdisciplinarity in DH. The selection of the literature was based on a combination of keyword search with Scopus and articles published in the key journals in the field (i.e. journals published by members of ADHO). A dual trend of gradual increases in both topical diversity and network cohesion, two hallmarks of interdisciplinarity (Porter et.al., 2007; Rafols and Meyer 2010), were found in the co-citation network. For example, as shown in Figure 1, the average path length remained steady relative to the gradual increase of the network diameter, which suggests continuing growth of the literature and its gradual consolidation (the sharp rise at the tail being the artifact of citation window). On the other hand, the growing diversity was demonstrated by the steady increase of distinct author-assigned keywords over time.

Figure 1. Growing of co-citation network and average path length

In this article, the results of further analyses were reported that aimed to, firstly, further explore of the issue of disciplinary cohesion using co-authorship and bibliographic coupling networks; and secondly, to identify cohesive subgroups or specialties in the DH literature.

Disciplinary cohesion of DH

Based on the assumption that the knowledge integration process in research communities depends heavily on the topology of the underlying social network, Moody used the structure of collaborative (i.e. co-authorship) network to represent the integration of knowledge in Sociology over time (Moody, 2004). The concept of “structure cohesion” or “connectivity” in network analysis was used to measure the degree of social cohesion in Sociology, which is defined as “the extent to which a network will remain connected when nodes are removed from the network (Moody and White, 2003).” Moody (2004) discussed three types of network structures: star production, small world, and structurally cohesive and surmised on the corresponding collaborative practices each represents. Moody (2004) believe that a cohesive collaboration network signals the presence of “permeable theoretical boundaries and generic methods” that allows scholars specialized in particular theoretical, empirical or theoretical skills to collaborate freely. He added that, if enough scholars engage in this kind of cross-fertilization, mixing across multiple areas, there will be few clear divisions presented in the collaborative network (Moody, 2004). Similarly, Carolan (2008) used network structure of articles published by a leading journal in Education to examine how well the heterogeneous set of ideas and practices were integrated within the discipline.

As shown in Figure 2, contrary to the co-citation network, the DH co-authorship network is very sparse and highly fragmented. The percentage of the nodes in the main component hovered only below 20% even after discounting the isolates, which is extremely low compared to other disciplines or research areas (See Table 1). Notice the contrast is especially striking with sciences, medicine, and IT. The low percentage of nodes in main component (Figure 2), coupled with extremely high clustering coefficient and modularity (Figure 3), indicated most collaboration took place at the local level, lacking global “shortcuts” found in the small-world model to hold the network together (See Figure 4).

Figure 2. Percentage of nodes in the main component in the co-authorship network
Figure 3. Trends of clustering coefficient and modularity in co-authorship
Figure 4. Part of the co-authorship network
Figure 5. Three largest components in the co-authorship network

A closer examination of the co-authorship network shows that, beyond the two largest components, there were relatively few international collaborations (See Figure 5). The largest component was composed of scholars mainly from the U.S. (28.53), Canada (27.12%), U.K. (26.84%), and Germany (10.45%); the second largest component was composed of scholars from the U.S. (39.71%), the Netherlands (30.88%), and Japan (5.88%); and the third component was composed of all Italian scholars.

The identification of specialties in DH

Modularity maximization graph partition was applied to both the co-citation and bibliographic coupling networks to identify subgroups or specialties in DH. Both co-citation and bibliographic coupling have been widely used to establish similarity or linkages between documents in bibliometrics (See, for example, Yan and Ding, 2012; Boyack and Klavans, 2010).

The co-citation network was construed by using Google Scholar’s citation tracing function. The citations received by every article in our target set were first identified and downloaded then pair-wise matching was performed to identify shared citations. The bibliographic coupling was generated by pair-wise comparison of cited references retrieved from Scopus. A threshold of shared 3 citations in the reference lists was set to dichotomize the network. Figure 5 and 6 show the results of the modularity-based partition resulted from co-citation and bibliographic coupling networks, respectively. Due to the lack of global cohesion, only the giant component in either network was analyzed. We are currently in the process of identifying the research topics represented by the clusters in either network, which will be done by examining the author-assigned keywords and authors appearing in each cluster. The labeling of the clusters by each’s prominent authors has been shown to be an effective way of visualizing a field (e.g. White and McCain, 1998). Interviews with experts who have broad knowledge in the field of DH will be done in order to help us interpret the meaning of the clusters. Efforts will also be made to explore the reasons behind the lack of cohesion in the DH co-authorship network.

Figure 6. Modularity analysis of co-citation network
Figure 7. Modularity analysis of bibliographic coupling network
Bibliography Boyack, K.W., Klavans, R. (2010). Co‐citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? Journal of the American Society for Information Science and Technology, 61(12): 2389-404. Carolan, B.V. (2008). The structure of educational research: The Role of Multivocality in Promoting Cohesion in an Article Interlock Network. Social Networks, 30(1): 69-82. Moody, J. (2004). The structure of a social science collaboration network: Disciplinary cohesion from 1963 to 1999. American sociological review, 69(2): 213-38. Moody, J. and White, D.R. (2003). Structural cohesion and embeddedness: A hierarchical concept of social groups. American Sociological Review, pp. 103-27. Porter, A.L., Cohen, A.S., Roessner, J.D. and Perreault, M. (2007). Measuring researcher interdisciplinarity. Scientometrics, 72(1): 117-47. Rafols, I. and Meyer, M. (2010). Diversity and network coherence as indicators of interdisciplinarity: case studies in bionanoscience, 82(2): 263-87. White, H.D. and McCain, K.W. (1998). Visualizing a discipline: An author co-citation analysis of information science, 1972-1995. Journal of the American society for information science, 49(4): 327-55. Yan, E. and Ding, Y. (2012). Scholarly network similarities: How bibliographic coupling networks, citation networks, cocitation networks, topical networks, coauthorship networks, and coword networks relate to each other. Journal of the American Society for Information Science and Technology, 63(7): 1313-26.