Geovisualization: A Newer GIS Technology for Implementation Research in Health ()
1. Introduction
Complex and multidimensional data are examined to assemble meaningful information to improve planning, implementation and monitoring of programs and evidence based policymaking processes to improve population health [1] . Various advanced methodologies are used by program managers and researchers for examining data to serve specific purpose. As public health datasets such as Health Information and Management System (HMIS) and larger health surveys such as Demographic Health Surveys have become increasingly complex; there is a growing need for innovative data analysis methods and data representation tools for optimal utilization of information [2] [3] . A number of quantitative approaches have been used to address the complex analysis of these datasets [4] [5] but critical analysis and visualization methods of spatial data are limited in public health research.
Recently, Geographical Information System (GIS) has emerged as an innovative and important component of research and practice in public health [6] . GIS has proved to be useful for various research purposes including epidemiological surveys/investigation, implementation research, program/policy decision making and dissemination of information. Despite the advancement of computational and GIS technologies [7] ; there is an urgent need for novel approaches to represent geographic data in a visual form that can improve pattern recognition/trends and hypothesis generation [8] . More importantly, it should allow for better understanding and wider usage by non-GIS experts working in public health. Visual representations can often communicate information effectively and help decision makers prioritize the actions required to improve public health outcomes [9] . Maps are an efficient means for communication, analysis, synthesis, and exploration, of geographic data and information [10] . Traditional GIS visualization techniques focus on the presentation of points, lines and polygons in static maps, such as, paper based maps. Newer techniques such as web-based geovisualization allow users to explore specific phenomena in more efficient and effective ways to uncover or clarify dimensions that may be ambiguous in traditional GIS visualizations [11] .
Geovisualization is increasingly being used to inform public health research, planning and decision making in developed countries [8] . It aids etiologic investigations and distribution of diseases, optimal deployment of limited resources and policy/regulation adoption [12] . However, the best applicability for geovisualization in public health would be for spatial analysis to get comprehensive view that is easy to understand, for complex datasets [13] . Geovisualization is often underutilized owing to a lack of training, being relatively new GIS techniques and limited landmark examples in public health. Developing countries with weak health systems and higher burden of diseases that require such tools to inform policy making and program planning to maximize limited resources have even further underutilization [14] . In India, the disease pattern is complex with current epidemiological transition and coexistence of non-communicable and communicable diseases. There is limited evidence from low resource settings on how to design simple, functional geovisualization applications for difficult policy making and program prioritization [15] . To address the gap, this paper aims to demonstrate the potential of geovisualization technique and compare it with the traditional GIS methods using data from MATIND project in the Gujarat state of India.
2. Overview of GIS Techniques
2.1. Technique for Bringing Health Data in to GIS
Applications of GIS methods require accurate transformation of location information from the data into geographic objects; this process is known as geocoding (latitude and longitude). Geocoding which is transformation of quantitative data to GIS data is not only tough work but also very tedious and base pillar for any visualization with its rewards. Geocoded data can be processed in three major ways before visualization as described here:
2.1.1. Point Patterns
Attempt to display of the data points such as public and private health facilities, mothers’ home, village settlements etc. It is useful for defining areas of case occurrences, visual inspection of spatial clusters of patients or health facilities and analyzing health care resources distribution. An example of point pattern analysis in maternal health is the geographic uptake of a government scheme such as “Chiranjeevi Yojana” (CY) in the MATIND data.
2.1.2. Line Patterns
Vectors or road lines are graphic resources that aid understanding the proximity and accessibility of patients to health care facilities. Arrows with widths proportional to the volume of flow between areas are important tools to evaluate the health care utilization of different locations.
2.1.3. Area Patterns
For spatial descriptive analysis, the administrative boundary like village, block and District data on maps, with the variable of population, no. of house hold and no. of CY users can be used to represent the data in a specific dimension.
2.2. Traditional GIS Visualization Technique
Frequently used maps in the public health research are Dot-density and Chloropleth maps [16] . Dot-density maps are the simplest way to display events. These maps use dots or other symbols to represent the number of occurrences of a given data characteristic [17] . Each dot or symbol used on the map represents a single entity (one dot = 1 case) or a group (one dot = 1000 cases). Dot-density maps are useful for area comparisons. However, dot-density maps need to be interpreted with caution for the “symbol to data characteristic” ratio. An example of a dot-density map for MATIND CY users has been shown in “Figure 1”, where each dot represents a single mother or a health facility providing obstetric care.
Figure 1. Dot density map showing CY users.
Choropleth maps are area maps in which polygons are shaded, colored, or patterned according to the value of a given attribute for each polygon. Choropleth maps are also called thematic maps or Shaded maps. An example of a choropleth map for concentration of CY users in a district is shown in “Figure 2”. The same information from Figure 1 is presented as shaded areas where darker areas represent higher concentration of CY users.
Figure 2. Choropleth map showing CY users.
It is important to choose the right features for map presentations as the choice of color, pattern, size, polygon shape, and class intervals can impact how one interprets the information presented in a map. Single-color maps with varying color intensity (shades) are often an effective means of presenting data, but the use of differing patterns can help a black-and-white or grey-scale map. Similar-size polygons are recommended to the extent possible, as a few large polygons can dominate a map, leading to misinterpretation of information. Proportions or rates can be displayed by different class interval schemes, such as equal intervals (equal ranges of values) or quintiles (equal number of polygons falling into each class defined by dividing the range of values). The latter technique is particularly useful for presenting skewed data where distribution of utilization is not known or is very different.
2.3. Web Based Geovisualization Technique
Geovisualization is the process of geospatial data analysis where visualization is enabled through tools by the convergence of information, cartographic and geographic methods [14] . Specific use of this technique is for geospatial data displays to explore, analyze and synthesize data for generating hypotheses and developing solutions, and comprehensive representation of data. There are two philosophical approaches to geovisualization, the positivistic and phenomenological approach. The positivistic approach uses spatial modeling to best possible represent the real world. The phenomenological approach allows for individual interpretation of space and time and tends to represent space in the abstract [18] .
Addition of novel geo-web tools such as application programming interfaces (APIs) and open-source geospatial analysis coding packages in the Web 2.0 has further improved the utility through the positivistic approach [19] . Web 2.0 is distinctive with functions such as interaction, attribute filtering (including spatial and temporal attributes), dynamic and animated displays, near-real time data updates and advanced analysis. Because of the flexibility and utility of geovisualization, the only situation where it cannot be used is for non-spatial data [14] .
In the context of WebGIS, data representation is called “the science of virtual space”, or the way space is constructed, reconstructed, represented and finally interpreted through a specific method [20] . The degree of realism or abstractedness in the representation of symbols as well as space must be directly connected to the intended purpose of the tool. Owing to the size and complexity of datasets, computer integration is necessary to facilitate geographic inquiry. When implementing computational methods, for example, patterned aggregation of features by attribute, the synthesis of false relationships by the user must be avoided by optimizing the integration [20] . The geovisualization interface provides a bridge between the user and knowledge construction from the data and is considered an external representation in that interacting with the tool will alter the information that is displayed subsequent knowledge that is created [20] . Finally, different user groups will have different user experiences so the target user group needs to be a central consideration for tool design. The choice of data representation, degree of computational integration and interface design can significantly alter the cognitive usability for different users and must be considered together for a comprehensive design [21] . In order to optimize its utility, the design of a geovisualization should be based on rigorous methods of data representation, visualization-computation integration, user interface and matching of the tool with the intended purpose [22] .
Geovisualization Technique Used on MATIND Data
Python program was used to extract the MATIND data from the REDCap [23] (Research Electronic Data Capture is a secure, web-based application designed to support data capture for research studies) database and to convert the data into spatial information. The program utilized the REDCap API to extract the latest version of the database through the use of a unique API URL and key. The dataset was then parsed using the simple KML python module to create point KML features for each individual mother’s home location as defined by the centroid (geographic centre) of their home village. Separate KML layers, each indicating the outcome of a certain health variable for each mother (for example, CY status, ANC check-ups, type of delivery etc.), were also created specifically for the research objective requirements. Additionally, aggregated village polygon KML layers were created of the same indicators to represent regional level outcomes. The symbology of the layers (color gradient or symbol size/icon) was chosen based on the requirement. Once the spatial layers were created, and automatically uploaded to a cloud based server (Google Drive) where they could be accessed for the geovisualization as shown in “Figure 3”.
The best part of the tool is hosting large datasets on the cloud that can be accessed, edited and updated remotely and no longer needs to be on an individual hard drive or server. Subsequently, the data can be created
Figure 3. Process of development of web based geovisualization.
and/or managed as well as accessed and utilized by a group of actors including: governments, citizens, civil society, businesses or academia on a variety of platforms without needing special software. This is largely due to the convergence of geospatial data standards (led by the Open GIS Consortium) promoting interoperability between and within traditional GIS methods and Web-GIS applications such as geoweb [21] . Users can create online maps on services such as the Google Maps or Open Street Maps platform with very basic knowledge of any one of a multitude of computer languages. Furthermore, cloud and non-Cloud based datasets can be dynamically linked to these services in a variety of methods and spatial data formats.
“Figure 4” shows a template of web based geovisualization. From this step on, a non-GIS expert can access and use the spatial data as per their need without any requirement for GIS software or extensive training.
Here geovisualization is an HTML file (webpage) that uses the Google Maps API platform to display spatial data. A drop down menu of individual and aggregate level spatial indicators displayed in the bottom corner on the right side of the webpage allows the user to select the indicator/layer of their choice. As users explore the data, they are able to view all the attributes of each point or polygon by clicking the point of interest and viewing its attributes in a side window. To facilitate usability and reduce cognitive issues pertaining to cluttering the user is able to zoom and pan throughout the map, as well as change the style of the map (satellite, road map, custom, etc.)
Both conventional GIS methods and Geovisualization have their advantages and limitations. Decision of which technique to be used, depends on the purpose. For example, conventional GIS techniques are better for in- depth spatial data analysis while Geovisualization has superior spatial data display.
3. Comparison of Conventional GIS Technique with Geovisualization
Currently, the scope of using the technique of geovisualization is widening from exclusive use by GIS experts, innovators and early adopters towards a broader audience of non-GIS users including epidemiologists, policy makers and program planners. MATIND Gujarat data show that Geovisualization allows a number of variables to be displayed in a single view and gives the user an improved understanding of the complex relationship between these variables. For example, to understand the dynamic relationship between CY use and its predictors in the study areas such as proportion of CY eligible women, number of CY providers and their spatial distribution (as seen in Figure 3). This in turn can be useful to identify gaps and ways to improve implementation of program in a local context. “Table 1” compares newer geovisualization with traditional GIS soft-wares from different aspects.
Future of Geovisualization in Implementation Research in Health
As Slocum et al. suggest, “The most sophisticated technology will be of little use if people cannot utilize it ef- fectively” [17] . In the same vein, widespread use of geovisualization by public health researchers will make the Table 1. Comparison of traditional software based GIS techniques with geovisualization.
Figure 4. Web based Geovisualization snapshot of spatial distribution of CY users.
complex and labor intensive process of development of the tool worthwhile. This could be extended to the different areas of public health to evaluate the unmet needs by geographic areas which are important for realistic evaluation and to prioritize resource allocation for low resource environments such as India for optimal use and maximum possible coverage. In single line the conclusion is that GIS technologies work hard for program managers and planners without hard work by program managers and planners to learn advanced GIS techniques. Apart from ease of use, it is cheaper option to software based GIS technology and once created can be used for a long time by multiple stakeholders.
4. Conclusion
Public health experts are using GIS techniques sparsely due to difficulty in learning the technology and the cost of software. Geovisualization provides a user-friendly tool for presenting large scale community based survey data or routinely collected HMIS data without losing the complexity. Dynamic, interactive, and temporal geovisualization make it possible for non-GIS experts to understand and disseminate public health data which are inherently spatial in nature and cannot, as easily, be presented through the use of paper based GIS maps or by quantitative analysis only.
Author Contribution
Conceptualization: SY, KV, DVM. Analysis: SY, CH, AU. Writing: SY, KV, CH. Review and critical comments: KV, DVM.
Source of Funding
The research leading to these results has received funding from the European Community’s Seventh Framework Programme under grant agreement No. [261304].