1
|
Ullah A, Shabbir J, Alomair AM, Alarfaj FK. Compromise optimum allocation in neutrosophic multi-character survey under stratified random sampling using neutrosophic fuzzy programming. Heliyon 2024; 10:e28327. [PMID: 38571640 PMCID: PMC10988009 DOI: 10.1016/j.heliyon.2024.e28327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 02/24/2024] [Accepted: 03/15/2024] [Indexed: 04/05/2024] Open
Abstract
Survey sampling has wide range of applications in social and scientific investigation to draw inference about the unknown parameter of interest. In complex surveys, the sample information about the study variable cannot be expressed by a precise number under uncertain environment due fuzziness and indeterminacy. Therefore, this information is expressed by neutrosophic numbers rather than the classical numbers. The neutrosophic statistics, which is generalization of classical statistics, deals with the neutrosophic data that has some degree of indeterminacy and fuzziness. In this study, we investigate the compromise optimum allocation problem for estimating the population means of the neutrosophic study variables in a multi-character stratified random sampling under uncertain per unit measurement cost. We proposed the intuitionistic fuzzy cost function, modeling the fuzzy uncertainty in stratum per unit measurement cost. The compromise optimum allocation problem is formulated as a multi-objective intuitionistic fuzzy optimization problem. The solution methodology is suggested using neutrosophic fuzzy programming and intuitionistic fuzzy programming approaches. A numerical study includes the means estimation of atmospheric variables is presented to explore the real-life application, explain the mathematical formulation, and efficiency comparison with some existing methods. The results show that the suggested methods produce more precise estimates with less utilization of survey resources as compared to some existing methods. The Python is used for statistical analysis, graphical designing and numerical optimization problems are solved using GAMS.
Collapse
Affiliation(s)
- Atta Ullah
- Department of Statistics, Quaid-i-Azam University, Islamabad 45320, Pakistan
- Department of Mathematics, COMSATS University Islamabad, Attock Campus 43600, Pakistan
| | - Javid Shabbir
- Department of Statistics, Quaid-i-Azam University, Islamabad 45320, Pakistan
- Department of Statistics, University of Wah, Wah Cant 47040, Pakistan
| | - Abdullah Mohammed Alomair
- Department of Quantitative Methods, School of Business, King Faisal University, Al-Ahsa 31982, Saudi Arabia
| | - Fawaz Khaled Alarfaj
- Department of Management Information Systems (MIS), School of Business, King Faisal University, Al-Ahsa 31982, Saudi Arabia
| |
Collapse
|
2
|
Tenorio FA, Rattalino Edreira JI, Monzon JP, Aramburu-Merlos F, Dobermann A, Gruere A, Brihet JM, Gayo S, Conley S, Mourtzinis S, Mashingaidze N, Sananka A, Aston S, Ojeda JJ, Grassini P. Filling the agronomic data gap through a minimum data collection approach. Field Crops Res 2024; 308:109278. [PMID: 38495465 PMCID: PMC10933791 DOI: 10.1016/j.fcr.2024.109278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 01/12/2024] [Accepted: 01/24/2024] [Indexed: 03/19/2024]
Abstract
Context Agronomic data such as applied inputs, management practices, and crop yields are needed for assessing productivity, nutrient balances, resource use efficiency, as well as other aspects of environmental and economic performance of cropping systems. In many instances, however, these data are only available at a coarse level of aggregation or simply do not exist. Objectives Here we developed an approach that identifies sites for agronomic data collection for a given crop and country, seeking a balance between minimizing data collection efforts and proper representation of the main crop producing areas. Methods The developed approach followed a stratified sampling method based on a spatial framework that delineates major climate zones and crop area distribution maps, which guides selection of sampling areas (SA) until half of the national harvested area is covered. We provided proof of concept about the robustness of the approach using three rich databases including data on fertilizer application rates for maize, wheat, and soybean in Argentina, soybean in the USA, and maize in Kenya, which were collected via local experts (Argentina) and field surveys (USA and Kenya). For validation purposes, fertilizer rates per crop and nutrient derived at (sub-) national level following our approach were compared against those derived using all data collected from the whole country. Results Application of the approach in Argentina, USA, and Kenya resulted in selection of 12, 28, and 10 SAs, respectively. For each SA, three experts or 20 fields were sufficient to give a robust estimate of average fertilizer rates applied by farmers. Average rates at national level derived from our approach compared well with those derived using the whole database ( ± 10 kg N, ± 2 kg P, ± 1 kg S, and ± 5 kg K per ha) requiring less than one third of the observations. Conclusions The developed minimum crop data collection approach can fill the agronomic data gaps in a cost-effective way for major crop systems both in large- and small-scale systems. Significance The proposed approach is generic enough to be applied to any crop-country combination to guide collection of key agricultural data at national and subnational levels with modest investment especially for countries that do not currently collect data.
Collapse
Affiliation(s)
- Fatima A.M. Tenorio
- Department of Agronomy and Horticulture, University of Nebraska–Lincoln, Lincoln, NE 68583-0915, USA
| | - Juan I. Rattalino Edreira
- Department of Agronomy and Horticulture, University of Nebraska–Lincoln, Lincoln, NE 68583-0915, USA
| | - Juan Pablo Monzon
- Department of Agronomy and Horticulture, University of Nebraska–Lincoln, Lincoln, NE 68583-0915, USA
| | - Fernando Aramburu-Merlos
- Department of Agronomy and Horticulture, University of Nebraska–Lincoln, Lincoln, NE 68583-0915, USA
| | - Achim Dobermann
- International Fertilizer Association, 49 Avenue d′lena, 75116 Paris, France
| | - Armelle Gruere
- International Fertilizer Association, 49 Avenue d′lena, 75116 Paris, France
| | - Juan Martin Brihet
- Department of Technological Prospective and Research, Buenos Aires Grain Exchange, Buenos Aires, Argentina
| | - Sofia Gayo
- Department of Technological Prospective and Research, Buenos Aires Grain Exchange, Buenos Aires, Argentina
| | - Shawn Conley
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Spyridon Mourtzinis
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | | | | | | | - Patricio Grassini
- Department of Agronomy and Horticulture, University of Nebraska–Lincoln, Lincoln, NE 68583-0915, USA
| |
Collapse
|
3
|
Tang IW, Bartell SM, Vieira VM. Unmatched spatially stratified controls: A simulation study examining efficiency and precision using spatially-diverse controls and generalized additive models. Spat Spatiotemporal Epidemiol 2023; 45:100584. [PMID: 37301599 DOI: 10.1016/j.sste.2023.100584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 03/15/2023] [Accepted: 04/07/2023] [Indexed: 06/12/2023]
Abstract
Unmatched spatially stratified random sampling (SSRS) of non-cases selects geographically balanced controls by dividing the study area into spatial strata and randomly selecting controls from all non-cases within each stratum. The performance of SSRS control selection was evaluated in a case study spatial analysis of preterm birth in Massachusetts. In a simulation study, we fit generalized additive models using controls selected by SSRS or simple random sample (SRS) designs. We compared mean squared error (MSE), bias, relative efficiency (RE), and statistically significant map results to the model results with all non-cases. SSRS designs had lower average MSE (0.0042-0.0044) and higher RE (77-80%) compared to SRS designs (MSE: 0.0072-0.0073; RE across designs: 71%). SSRS map results were more consistent across simulations, reliably identifying statistically significant areas. SSRS designs improved efficiency by selecting controls that are geographically distributed, particularly from low population density areas, and may be more appropriate for spatial analyses.
Collapse
Affiliation(s)
- Ian W Tang
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA.
| | - Scott M Bartell
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA; Department of Statistics, Donald Bren School of Information & Computer Sciences, University of California, Irvine, USA
| | - Verónica M Vieira
- Department of Environmental and Occupational Health, Program in Public Health, Susan and Henry Samueli College of Health Sciences, University of California, 100 Theory Drive, Suite 100, Irvine, CA 92617, USA
| |
Collapse
|
4
|
Abstract
OBJECTIVES Although comorbidity increases the health care and community support needs for patients, and the burden for the health care system, there are few population-based studies on comorbidity in patients with stroke. This study aims to evaluate the occurrence of important comorbidities among stroke patients in the Canadian population. METHODS Data from the population-based 2011-2012 Canadian Community Health Survey containing responses from 124,929 participants covering about 98% of the Canadian population when weighted were examined and analyzed by means of logistic regression models. RESULTS There was a statistically significant association between stroke history and multiple comorbid risk factors. Stroke prevalence increased in individuals with heart disease (odds ratio (OR): 3.80, 95% confidence interval (CI): 3.77-3.84), hypertension (OR: 1.97, 95% CI: 1.95-1.99), diabetes (OR: 1.74, 95% CI: 1.72-1.75), mood disorder (OR: 2.14, 95% CI: 2.12-2.17), and chronic obstructive pulmonary disease (COPD) (OR: 1.46, 95% CI: 1.44-1.48) compared to others without the condition. Of 2067 participants with stroke, 1680 (81.3%) had one or more comorbid conditions (heart disease, hypertension, diabetes, mood disorder, or COPD) that coexist with stroke and 48% had two or more. Comorbidity increased with age, and two-thirds of stroke patients with comorbid medical conditions were 60 years of age or older. CONCLUSION This population-based study provides evidence of comorbidity between stroke and other conditions that include heart disease, hypertension, diabetes, mood disorder, and COPD. Canadian individuals with stroke have a high burden of comorbidity. Health care systems need to recognize and respond to the strong association of comorbidity and stroke occurrence. This key factor should be considered when allocating resources.
Collapse
|
5
|
Siegel JW, Xu J. Approximation rates for neural networks with general activation functions. Neural Netw 2020; 128:313-321. [PMID: 32470796 DOI: 10.1016/j.neunet.2020.05.019] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Revised: 05/08/2020] [Accepted: 05/14/2020] [Indexed: 10/24/2022]
Abstract
We prove some new results concerning the approximation rate of neural networks with general activation functions. Our first result concerns the rate of approximation of a two layer neural network with a polynomially-decaying non-sigmoidal activation function. We extend the dimension independent approximation rates previously obtained to this new class of activation functions. Our second result gives a weaker, but still dimension independent, approximation rate for a larger class of activation functions, removing the polynomial decay assumption. This result applies to any bounded, integrable activation function. Finally, we show that a stratified sampling approach can be used to improve the approximation rate for polynomially decaying activation functions under mild additional assumptions.
Collapse
Affiliation(s)
- Jonathan W Siegel
- Department of Mathematics, Pennsylvania State University, University Park, PA 16802, USA.
| | - Jinchao Xu
- Department of Mathematics, Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
6
|
Reina-García J, Toro-Vélez AF, Peña-Varón MR, Olaya-Ochoa J, Figueroa-Casas A. Methodological design for the macro-location of a micropollutants monitoring network in tropical rivers: a case study in Cauca River. Environ Monit Assess 2020; 192:205. [PMID: 32124068 DOI: 10.1007/s10661-020-8154-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 02/11/2020] [Indexed: 06/10/2023]
Abstract
Establishing scientifically the macro-location of a micropollutants monitoring network in tropical Andean rivers is a complex process, because information gathering is restricted by high-cost of analysis and limited availability of analytical techniques, which lead to inadequate sampling strategies that hinder the representativeness of samples. Thus, this work proposes a methodology for determining the number of representative sampling sections in a micropollutant monitoring network to characterise the ecological risk in tropical Andean torrential rivers. The proposed methodology consists of four stages: identification of the potential sampling units by Spline interpolation; calculation of the number of representative sections for a stratified sampling with an acceptable level of confidence and error; spatial allocation of the potential sampling units into sections by hierarchical cluster analysis; and representation of the spatial distribution of the sampling sections through a geographic information system (GIS). The proposed methodology is dynamic, and therefore, it can be revisited as more data are obtained in the subsequent years; it has the possibility of being applied to other inter-Andean valley rivers that interact with the tropical Andean sloppy mountains and serves as a tool for decision making by environmental authorities regarding the optimisation of the existing monitoring networks in terms of micropollutants to promote sustainable management of water resources. The proposed methodology is applied in the Upper Cauca River Basin (UCRB), which is located in southwest Colombia, South America.
Collapse
Affiliation(s)
- Jhovana Reina-García
- Cinara Institute, Faculty of Engineering, University of Valle, Cali, 76001, Colombia.
| | - Andrés F Toro-Vélez
- Doctorate in Environmental Sciences, University of Cauca, Popayán, 190003, Colombia
| | - Miguel R Peña-Varón
- Cinara Institute, Faculty of Engineering, University of Valle, Cali, 76001, Colombia
| | - Javier Olaya-Ochoa
- School of Statistics, Faculty of Engineering, University of Valle, Cali, 76001, Colombia
| | | |
Collapse
|
7
|
Du Z, Chen Q, Lyu X, Wang T, Wang C. Analyzing the distribution of rabies clinics and achievements of standardized rabies clinics implementation in mainland China. BMC Health Serv Res 2019; 19:955. [PMID: 31829178 PMCID: PMC6907271 DOI: 10.1186/s12913-019-4730-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 11/08/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND For rabies prevention and treatment, the Chinese government has been establishing standardized rabies clinics since 2016. This study aimed to investigate the distribution of rabies clinics and the achievements of newly-implemented standardized rabies clinics in mainland China, for the purpose of providing further rabies control strategies. METHODS The number of rabies clinics, including per million inhabitants in each region, was determined. We sampled 1200 clinics from 8 provinces by multi-stage stratified sampling, and a questionnaire survey was carried out to record each clinic's achievements. Data collected from 1185 questionnaires were analyzed. RESULTS We found that rabies clinics were mostly located in the southwest, central, and eastern regions of China; these accounted for 67.1% of all clinics. The eastern and south regions showed the lowest number of rabies clinics per million inhabitants (0.15 and 0.12, respectively). The total standard-reaching rate of rabies clinics in mainland China was only 11.0%, with significant differences in the rate among regions (X2 = 33.004, p < 0.001). Specifically, the qualified rates of supporting facilities and functional areas were 13.9% (X2 = 34.003, p < 0.001) and 56.1% (X2 = 9.943, p = 0.019), respectively. Vaccines with 2 different substrates and professional flushing equipment were provided by 40.5% (X2 = 27.935, p = 0.001) and 37.7% (X2 = 54.922, p = 0.001) of clinics, respectively. CONCLUSION Regional differences do exist in the distribution of rabies clinics in mainland China, with relative low number per million population in south and eastern China. There are few standardized rabies clinics in mainland China. Efforts are needed to establish supporting facilities, especially for wound treatment and vaccination. Future research should focus on the improvement of rabies clinics standardization.
Collapse
Affiliation(s)
- Zhe Du
- Trauma Center, Peking University People's Hospital, No.11 South Xizhimen Street, Beijing, 100044, China
| | - Qingjun Chen
- Department of Emergency, Beijing Hepingli Hospital, Beijing, China
| | - Xinjun Lyu
- China Center for Disease Control and Prevention, Institute for Viral Disease Prevention and Control, Beijing, China
| | - Tianbing Wang
- Trauma Center, Peking University People's Hospital, No.11 South Xizhimen Street, Beijing, 100044, China.
| | - Chuanlin Wang
- Trauma Center, Peking University People's Hospital, No.11 South Xizhimen Street, Beijing, 100044, China. .,Department of Emergency, Peking University People's Hospital, Beijing, China.
| |
Collapse
|
8
|
Kim S, Zeng D, Cai J. Analysis of multiple survival events in generalized case-cohort designs. Biometrics 2018; 74:1250-1260. [PMID: 29992545 DOI: 10.1111/biom.12923] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 05/01/2018] [Accepted: 05/01/2018] [Indexed: 01/04/2023]
Abstract
Generalized case-cohort design has been proposed to assess the effects of exposures on survival outcomes when measuring exposures is expensive and events are not rare in the cohort. In such design, expensive exposure information is collected from both a (stratified) randomly selected subcohort and a subset of individuals with events. In this article, we consider extension of such design to study multiple types of survival events by selecting a proportion of cases for each type of event. We propose a general weighting scheme to analyze data. Furthermore, we examine the optimal choice of weights and show that this optimal weighting yields much improved efficiency gain both asymptotically and in simulation studies. Finally, we apply our proposed methods to data from the Atherosclerosis Risk in Communities study.
Collapse
Affiliation(s)
- Soyoung Kim
- Division of Biostatistics, Medical College of Wisconsin, Milwaukee, Wisconsin, U.S.A
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
| | - Jianwen Cai
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
| |
Collapse
|
9
|
Liu J, Wang G. Pharmacovigilance from social media: An improved random subspace method for identifying adverse drug events. Int J Med Inform 2018; 117:33-43. [PMID: 30032963 DOI: 10.1016/j.ijmedinf.2018.06.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 05/10/2018] [Accepted: 06/12/2018] [Indexed: 11/17/2022]
Abstract
OBJECTIVE Recent advances in Web 2.0 technologies have seen significant strides towards utilizing patient-generated content for pharmacovigilance. Social media-based pharmacovigilance has great potential to augment current efforts and provide regulatory authorities with valuable decision aids. Among various pharmacovigilance activities, identifying adverse drug events (ADEs) is very important for patient safety. However, in health-related discussion forums, ADEs may confound with drug indications and beneficial effects, etc. Therefore, the focus of this study is to develop a strategy to identify ADEs from other semantic types, and meanwhile to determine the drug that an ADE is associated with. MATERIALS AND METHODS In this study, two groups of features, i.e., shallow linguistic features and semantic features, are explored. Moreover, motivated and inspired by the characteristics of explored two feature categories for social media-based ADE identification, an improved random subspace method, called Stratified Sampling-based Random Subspace (SSRS), is proposed. Unlike conventional random subspace method that applies random sampling for subspace selection, SSRS adopts stratified sampling-based subspace selection strategy. RESULTS A case study on heart disease discussion forums is performed to evaluate the effectiveness of the SSRS method. Experimental results reveal that the proposed SSRS method significantly outperforms other compared ensemble methods and existing approaches for ADE identification. DISCUSSION AND CONCLUSION Our proposed method is easy to implement since it is based on two feature sets that can be naturally derived, and therefore, can omit artificial stratum generation efforts. Moreover, SSRS has great potential of being applied to deal with other high-dimensional problems that can represent original data from two different aspects.
Collapse
Affiliation(s)
- Jing Liu
- School of Management Science and Engineering, Tianjin University of Finance and Economics, Tianjin 300222, PR China
| | - Gang Wang
- School of Management, Hefei University of Technology, Hefei, Anhui 230009, PR China.
| |
Collapse
|
10
|
Diaz-Quijano FA. Sample allocation balancing overall representativeness and stratum precision. Ann Epidemiol 2018; 28:570-575. [PMID: 29784525 DOI: 10.1016/j.annepidem.2018.04.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Revised: 03/20/2018] [Accepted: 04/27/2018] [Indexed: 11/28/2022]
Abstract
PURPOSE In large-scale surveys, it is often necessary to distribute a preset sample size among a number of strata. Researchers must make a decision between prioritizing overall representativeness or precision of stratum estimates. Hence, I evaluated different sample allocation strategies based on stratum size. METHODS The strategies evaluated herein included allocation proportional to stratum population; equal sample for all strata; and proportional to the natural logarithm, cubic root, and square root of the stratum population. This study considered the fact that, from a preset sample size, the dispersion index of stratum sampling fractions is correlated with the population estimator error and the dispersion index of stratum-specific sampling errors would measure the inequality in precision distribution. Identification of a balanced and efficient strategy was based on comparing those both dispersion indices. RESULTS Balance and efficiency of the strategies changed depending on overall sample size. As the sample to be distributed increased, the most efficient allocation strategies were equal sample for each stratum; proportional to the logarithm, to the cubic root, to square root; and that proportional to the stratum population, respectively. CONCLUSIONS Depending on sample size, each of the strategies evaluated could be considered in optimizing the sample to keep both overall representativeness and stratum-specific precision.
Collapse
|
11
|
Lafontaine SJV, Sawada M, Kristjansson E. A direct observation method for auditing large urban centers using stratified sampling, mobile GIS technology and virtual environments. Int J Health Geogr 2017; 16:6. [PMID: 28209210 PMCID: PMC5314488 DOI: 10.1186/s12942-017-0079-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Accepted: 02/04/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND With the expansion and growth of research on neighbourhood characteristics, there is an increased need for direct observational field audits. Herein, we introduce a novel direct observational audit method and systematic social observation instrument (SSOI) for efficiently assessing neighbourhood aesthetics over large urban areas. METHODS Our audit method uses spatial random sampling stratified by residential zoning and incorporates both mobile geographic information systems technology and virtual environments. The reliability of our method was tested in two ways: first, in 15 Ottawa neighbourhoods, we compared results at audited locations over two subsequent years, and second; we audited every residential block (167 blocks) in one neighbourhood and compared the distribution of SSOI aesthetics index scores with results from the randomly audited locations. Finally, we present interrater reliability and consistency results on all observed items. RESULTS The observed neighbourhood average aesthetics index score estimated from four or five stratified random audit locations is sufficient to characterize the average neighbourhood aesthetics. The SSOI was internally consistent and demonstrated good to excellent interrater reliability. At the neighbourhood level, aesthetics is positively related to SES and physical activity and negatively correlated with BMI. CONCLUSION The proposed approach to direct neighbourhood auditing performs sufficiently and has the advantage of financial and temporal efficiency when auditing a large city.
Collapse
Affiliation(s)
| | - M Sawada
- Laboratory for Applied Geomatics and GIS Science (LAGGISS), Department of Geography, Environment and Geomatics, University of Ottawa, Ottawa, ON, K1N 6N5, Canada.
- Ottawa Neighbourhood Study (ONS), University of Ottawa, Vanier 5023, 136 Jean Jacques Lussier, Ottawa, ON, K1N 6N5, Canada.
| | - Elizabeth Kristjansson
- Ottawa Neighbourhood Study (ONS), University of Ottawa, Vanier 5023, 136 Jean Jacques Lussier, Ottawa, ON, K1N 6N5, Canada
- School of Psychology and Institute of Population Health, University of Ottawa, Ottawa, ON, K2L 1K9, Canada
| |
Collapse
|
12
|
Wickham J, Stehman SV, Gass L, Dewitz JA, Sorenson DG, Granneman BJ, Poss RV, Baer LA. Thematic accuracy assessment of the 2011 National Land Cover Database (NLCD). Remote Sens Environ 2017; 191:328-341. [PMID: 31346298 PMCID: PMC6657805 DOI: 10.1016/j.rse.2016.12.026] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Accuracy assessment is a standard protocol of National Land Cover Database (NLCD) mapping. Here we report agreement statistics between map and reference labels for NLCD 2011, which includes land cover for ca. 2001, ca. 2006, and ca. 2011. The two main objectives were assessment of agreement between map and reference labels for the three, single-date NLCD land cover products at Level II and Level I of the classification hierarchy, and agreement for 17 land cover change reporting themes based on Level I classes (e.g., forest loss; forest gain; forest, no change) for three change periods (2001-2006, 2006-2011, and 2001-2011). The single-date overall accuracies were 82%, 83%, and 83% at Level II and 88%, 89%, and 89% at Level I for 2011, 2006, and 2001, respectively. Many class-specific user's accuracies met or exceeded a previously established nominal accuracy benchmark of 85%. Overall accuracies for 2006 and 2001 land cover components of NLCD 2011 were approximately 4% higher (at Level II and Level I) than the overall accuracies for the same components of NLCD 2006. The high Level I overall, user's, and producer's accuracies for the single-date eras in NLCD 2011 did not translate into high class-specific user's and producer's accuracies for many of the 17 change reporting themes. User's accuracies were high for the no change reporting themes, commonly exceeding 85%, but were typically much lower for the reporting themes that represented change. Only forest loss, forest gain, and urban gain had user's accuracies that exceeded 70%. Lower user's accuracies for the other change reporting themes may be attributable to the difficulty in determining the context of grass (e.g., open urban, grassland, agriculture) and between the components of the forest-shrubland-grassland gradient at either the mapping phase, reference label assignment phase, or both. NLCD 2011 user's accuracies for forest loss, forest gain, and urban gain compare favorably with results from other land cover change accuracy assessments.
Collapse
Affiliation(s)
- James Wickham
- Systems Exposure Division, National Exposure Research Laboratory, U.S. Environmental Protection Agency, Research Triangle Park, NC 27111, USA
| | - Stephen V Stehman
- College of Environmental Science and Forestry, State University of New York, Syracuse, NY 13210, USA
| | - Leila Gass
- U.S. Geological Survey, 520 N. Park Ave., Tucson, AZ 85179, USA
| | - Jon A Dewitz
- Earth Resources and Observation Science (EROS) Center, U.S. Geological Survey, 47914 252nd St., Sioux Falls, SD 57198, USA
| | | | - Brian J Granneman
- SGT, Inc., Contractor to the USGS Earth Resources and Observation Science (EROS) Center, U.S. Geological Survey, 47914 252nd St., Sioux Falls, SD 57198, USA
| | - Richard V Poss
- U.S. Geological Survey, Box 25406, Denver Federal Center, MS 980, Denver, CO 80225, USA
| | - Lori A Baer
- U.S. Geological Survey, Box 25406, Denver Federal Center, MS 980, Denver, CO 80225, USA
| |
Collapse
|
13
|
Abstract
In population studies, it is standard to sample data via designs in which the population is divided into strata, with the different strata assigned different probabilities of inclusion. Although there have been some proposals for including sample survey weights into Bayesian analyses, existing methods require complex models or ignore the stratified design underlying the survey weights. We propose a simple approach based on modeling the distribution of the selected sample as a mixture, with the mixture weights appropriately adjusted, while accounting for uncertainty in the adjustment. We focus for simplicity on Dirichlet process mixtures but the proposed approach can be applied more broadly. We sketch a simple Markov chain Monte Carlo algorithm for computation, and assess the approach via simulations and an application.
Collapse
Affiliation(s)
- T Kunihama
- Department of Statistics, University of Washington, Seattle, WA 98195, U.S.A
| | - A H Herring
- Department of Biostatistics and Carolina Population Center, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, U.S.A
| | - C T Halpern
- Department of Maternal and Child Health and Carolina Population Center, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, U.S.A
| | - D B Dunson
- Department of Statistical Science, Duke University, Durham, NC 27708, U.S.A
| |
Collapse
|
14
|
Bai X, Tsiatis AA. A log rank type test in observational survival studies with stratified sampling. Lifetime Data Anal 2016; 22:280-298. [PMID: 26025499 PMCID: PMC4664585 DOI: 10.1007/s10985-015-9331-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 05/20/2015] [Indexed: 06/04/2023]
Abstract
In randomized clinical trials, the log rank test is often used to test the null hypothesis of the equality of treatment-specific survival distributions. In observational studies, however, the ordinary log rank test is no longer guaranteed to be valid. In such studies we must be cautious about potential confounders; that is, the covariates that affect both the treatment assignment and the survival distribution. In this paper, two cases were considered: the first is when it is believed that all the potential confounders are captured in the primary database, and the second case where a substudy is conducted to capture additional confounding covariates. We generalize the augmented inverse probability weighted complete case estimators for treatment-specific survival distribution proposed in Bai et al. (Biometrics 69:830-839, 2013) and develop the log rank type test in both cases. The consistency and double robustness of the proposed test statistics are shown in simulation studies. These statistics are then applied to the data from the observational study that motivated this research.
Collapse
|
15
|
Eide HK, Šaltytė Benth J, Sortland K, Halvorsen K, Almendingen K. Prevalence of nutritional risk in the non-demented hospitalised elderly: a cross-sectional study from Norway using stratified sampling. J Nutr Sci 2015; 4:e18. [PMID: 26097701 DOI: 10.1017/jns.2015.8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2014] [Revised: 08/31/2014] [Accepted: 01/21/2015] [Indexed: 12/27/2022] Open
Abstract
There is a lack of accurate prevalence data on undernutrition and the risk of
undernutrition among the hospitalised elderly in Europe and Norway. We aimed at estimating
the prevalence of nutritional risk by using stratified sampling along with adequate power
calculations. A cross-sectional study was carried out in the period 2011 to 2013 at a
university hospital in Norway. Second-year nursing students in acute care clinical studies
in twenty hospital wards screened non-demented elderly patients for nutritional risk, by
employing the Nutritional Risk Screening 2002 (NRS2002) form. In total, 508 patients (48·8
% women and 51·2 % men) with a mean age of 79·6 (sd 6·4) years were screened by
the students. Mean BMI was 24·9 (sd 4·9) kg/m2, and the patients had
been hospitalised for on average 5·3 (sd 6·3) d. WHO's BMI cut-off values
identified 6·5 % as underweight, 48·0 % of normal weight and 45·5 % as overweight.
Patients nutritionally at risk had been in hospital longer and had lower average weight
and BMI compared with those not at risk (all P < 0·001); no
differences in mean age or sex were observed. The prevalence of nutritional risk was
estimated to be 45·4 (95 % CI 41·7 %, 49·0) %, ranging between 20·0 and 65·0 % on
different hospital wards. The present results show that the prevalence of nutritional risk
among elderly patients without dementia is high, suggesting that a large proportion of the
hospitalised elderly are in need of nutritional treatment.
Collapse
|
16
|
Costa EG, Lopes RM, Singer JM. Implications of heterogeneous distributions of organisms on ballast water sampling. Mar Pollut Bull 2015; 91:280-287. [PMID: 25510550 DOI: 10.1016/j.marpolbul.2014.11.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Revised: 11/17/2014] [Accepted: 11/20/2014] [Indexed: 06/04/2023]
Abstract
Ballast water sampling is one of the problems still needing investigation in order to enforce the D-2 Regulation of the International Convention for the Control and Management of Ship Ballast Water and Sediments. Although statistical "representativeness" of the sample is an issue usually discussed in the literature, neither a definition nor a clear description of its implications are presented. In this context, we relate it to the heterogeneity of the distribution of organisms in ballast water and show how to specify compliance tests under different models based on the Poisson and negative binomial distributions. We provide algorithms to obtain minimum sample volumes required to satisfy fixed limits on the probabilities of Type I and II errors. We show that when the sample consists of a large number of aliquots, the Poisson model may be employed even under moderate heterogeneity of the distribution of the organisms in the ballast water tank.
Collapse
Affiliation(s)
- Eliardo G Costa
- Departamento de Estatística, Instituto de Matemática e Estatística, Universidade de São Paulo, São Paulo, Brazil.
| | - Rubens M Lopes
- Instituto Oceanográfico, Universidade de São Paulo, São Paulo, Brazil.
| | - Julio M Singer
- Departamento de Estatística, Instituto de Matemática e Estatística, Universidade de São Paulo, São Paulo, Brazil.
| |
Collapse
|
17
|
Kim L, Kim JA, Kim S. A guide for the utilization of Health Insurance Review and Assessment Service National Patient Samples. Epidemiol Health 2014; 36:e2014008. [PMID: 25078381 PMCID: PMC4151963 DOI: 10.4178/epih/e2014008] [Citation(s) in RCA: 389] [Impact Index Per Article: 38.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Accepted: 07/29/2014] [Indexed: 11/09/2022] Open
Abstract
The claims data of the Health Insurance Review and Assessment Service (HIRA) is an important source of information for healthcare service research. The claims data of HIRA is collected when healthcare service providers submit a claim to HIRA to be reimbursed for a service that they provided to patients. To improve the accessibility of healthcare service researchers to claims data of HIRA, HIRA has developed the Patient Samples which are extracted using a stratified randomized sampling method. The Patient Samples of HIRA consist of five tables: a table for general information (Table 20) containing socio-demographic information such as gender, age and medical aid, indicators for inpatient and outpatient services; a table for specific information on healthcare services provided (Table 30); a table for diagnostic information (Table 40); a table for outpatient prescriptions (Table 53) and a table for information on healthcare service providers (Table of providers). Researchers who are interested in using the Patient Sample data for research can apply via HIRA's website (https://www.hira.or.kr).
Collapse
Affiliation(s)
- Logyoung Kim
- Health Insurance Review and Assessment Service, Seoul, Korea
| | - Jee-Ae Kim
- Health Insurance Review and Assessment Service, Seoul, Korea
| | - Sanghyun Kim
- Health Insurance Review and Assessment Service, Seoul, Korea
| |
Collapse
|
18
|
Bai X, Tsiatis AA, O'Brien SM. Doubly-robust estimators of treatment-specific survival distributions in observational studies with stratified sampling. Biometrics 2013; 69:830-9. [PMID: 24117096 DOI: 10.1111/biom.12076] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 05/01/2013] [Accepted: 06/01/2013] [Indexed: 11/30/2022]
Abstract
Observational studies are frequently conducted to compare the effects of two treatments on survival. For such studies we must be concerned about confounding; that is, there are covariates that affect both the treatment assignment and the survival distribution. With confounding the usual treatment-specific Kaplan-Meier estimator might be a biased estimator of the underlying treatment-specific survival distribution. This article has two aims. In the first aim we use semiparametric theory to derive a doubly robust estimator of the treatment-specific survival distribution in cases where it is believed that all the potential confounders are captured. In cases where not all potential confounders have been captured one may conduct a substudy using a stratified sampling scheme to capture additional covariates that may account for confounding. The second aim is to derive a doubly-robust estimator for the treatment-specific survival distributions and its variance estimator with such a stratified sampling scheme. Simulation studies are conducted to show consistency and double robustness. These estimators are then applied to the data from the ASCERT study that motivated this research.
Collapse
Affiliation(s)
- Xiaofei Bai
- Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695, U.S.A..
| | | | | |
Collapse
|
19
|
Sjöholm L, Andersson B, Högberg N, Widmark AK, Yuen J. Genotypic diversity and migration patterns of Phytophthora infestans in the Nordic countries. Fungal Biol 2013; 117:722-30. [PMID: 24119411 DOI: 10.1016/j.funbio.2013.08.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2012] [Revised: 08/07/2013] [Accepted: 08/08/2013] [Indexed: 10/26/2022]
Abstract
In this study we investigated the genotypic diversity and the migration patterns of Phytophthora infestans in the Nordic countries. Isolates of P. infestans from outbreaks in 43 fields sampled in 2008 were collected using stratified sampling with country, field, and disease foci as the different strata. Microsatellites were used as markers to determine the genotypic variation in the sampled material. The results show a high genotypic variation of P. infestans in the Nordic countries with most of the genotypes found only once among the collected isolates. The major part of the genotypic variation was observed within the fields, with low differentiation between the fields. The observed low association of alleles among loci is consistent with frequent sexual reproduction of P. infestans in the Nordic countries. Coalescence analyses did not support a single common population for the four countries, thus indicating some degree of geographic differentiation. The analyses of migration patterns showed differing levels of gene flow among the Nordic countries. No correlation between migration rates and geographical distance could be seen. This could be explained by different degrees of genetic similarity between the pathogen populations in the different countries.
Collapse
Affiliation(s)
- Lina Sjöholm
- Uppsala BioCenter, Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Box 7026, SE-75007 Uppsala, Sweden
| | | | | | | | | |
Collapse
|