SafeServer Error Rate For First 1,000 .com Domains

페이지 정보

profile_image
작성자 Krystle
댓글 0건 조회 72회 작성일 24-06-29 22:09

본문

Charlotte_Cushman_ambrotype.jpgOf the first 1,000 working .com domains, 44 were blocked by SafeServer. Of these 44 blocked sites, we eliminated 15 sites that were "Under construction" pages (the list of 15 non-functioning sites is here). Of the remaining 29 sites, 10 were errors (i.e. 19 were non-errors (i.e. 34%, or roughly one site blocked incorrectly for every two blocked sites that meet SafeServer's criteria. SafeServer features iCRT: a leading-edge technology based on artificial intelligence and pattern recognition technologies. The technology is trained to detect English-language pornography. SafeServer evaluates each incoming web page for inappropriate material. If the page is unacceptable, lesbian the browser displays a block page explaining why the page was refused and suggesting alternative sites. Filtering categories include: Hate, Pornography, Gambling, Weapons, Drugs, Job Search and Stock Trading. Thus, unlike most blocking programs, SafeServer does not come with a built-in list of "bad sites"; it examines each site as it is downloaded. There were no blocked sites that we considered to be "borderline" cases, e.g. The sites in this sample were either pornographic sites (e.g. The list of blocked sites was current as of October 2000. Only the "Hate", "Pornography", "Gambling", "Weapons" and "Drugs" categories were enabled. We started with zone files from Network Solutions listing all .com domains in alphabetical order. Michael Sims supplied the first 10,000 domains in alphabetical order from that list, after eliminating sites at the top whose names started mostly with all "-" dashes. We used this script to narrow down the list to the first 1,000 pingable domains sorted alphabetically by domain name. We used the first 1,000 working domain names in our sample in order to make our sample "provably random". A truly random sample chosen from the entire list of domain names would have been better, but it would be impossible to prove that such a sample had really been chosen randomly; a third party could easily claim that we had "stacked the deck" by choosing a disproportionate number of sites blocked incorrectly by SafeServer. A sample of 29 "real" sites that are blocked by SafeServer, is a small sample from which to draw any precise conclusions. The problem with the sample size is that we had to start with 1,000 randomly chosen Web domains just to get a sample of 29 blocked domains. The 34% figure should not be taken as being accurate to even two significant figures; across all .com domains in existence, the error rate for SurfWatch could be as low as 15%. However, the test does establish that the likelihood of SafeServer having an error rate of, say, less than 1% across all domains, is virtually zero. A note on interpreting these results: the results are not weighted by Web site traffic, so some of the sites in this experiment may cause more "Access Denied" messages than others. The 34% error rate should also not be interpreted to apply across all domains, since we only used .com domains in our experiment, which are more likely to contain commercial pornography than, say, .org domains. Based on the error rate found in this experiment, we conclude that the overall accuracy rate is low, and that about one third of sites blocked by SafeServer do not meet their criteria.

댓글목록

등록된 댓글이 없습니다.