// Adobe Recognizer Framework (ARF) Recognition Process Script // Controls the sequence of Experts called per-page, and per-document. // All parameters are interpreted as signed long integers. // Up to 6 named parameters can be passed to an Expert, separated by commas (no spaces). Globals: log.enable=0 // toggle log file creation log.popup=0 // pop up log on normal doc completion log.verbosity=0 // 0=basics, 1=detailed, 2=maximum detail output.deskew=1 // 0=no deskew, 1=deskew, 2=deskew and save output.multienc=0 // 0=doublebyte encoding, 1=multibyte encoding output.markup=0 // 0=no markup, 1=render markup output.debugFontGroups=0 // 0=no debug font groups, 1=debug font groups output.colorLigs=0 // 1=>make split ligs green and rest red output.maxDewarp=35 // 0=no dewarp, 25= max of rise of 25 pixels per run of 1000 output.maxResidualDeskew=25 // 0=no residual deskew output.dewarpTextRegions=1 output.saveIntPDF=0 // 1=>save intermediate pdf to disk (hardwired to C:\Temp\A.pdf) output.debugOCR=0 // 1=single-image + hidden text for debugging OCR result bboxes output.xmlOutput=0 // 1=save IRIS data structure output.foregroundCompression=-1 // This should be 1 (low quality) to 5 (high quality). It can also be -1 which means use the default behaviour. output.backgroundCompression=-1 // This should be 1 (low quality) to 5 (high quality). It can also be -1 which means use the default behaviour. output.foregroundDownsampling=-1 // This should be 0 (no downsampling), 72, 150, 300, or 600. It can also be -1 which means use the default behaviour. output.backgroundDownsampling=-1 // This should be 0 (no downsampling), 72, 150, 300, or 600. It can also be -1 which means use the default behaviour. output.foregroundLossless=0 // 0=use jpeg, 1=use zip output.bezierFitFonts=1 // 0=no bezier fit fonts, 1=bezier fit fonts output.edgeBleeding=1 // 1=edge bleeding on 0=edge bleeding off debug.useOutputRoot=1 // used in PageSaver and in BackgroundMaker Process_Page_Pass_1: GlyphCleaner (minCoverage=90) TemplateBuilder (maxDist=80) // maxDist is matcher threshold FontRecognizer(maxDist=80) GraphicLineFinder (linetype=3) // 0=none; 1=underline; 2=non-underline; 3=both BaselineFinder (fixLines=1) FontBuilder () RiskAssessor (reportRiskyWords=0) // find risky text for possible revert-to-image: 1=report risky words to the debugger console BackgroundMaker (despeckle=1,vectors=1,dewarp=1,imagelift=1,saveHistograms=0,whitePt=255) // imagelift=1 for IRIS images, imagelift=2 for ARF images LigatureSplitter () SideBearingFinder () BaselineFinder (update=1) // update template baseline data CompositeMaker () // average glyph bmps Idealizer (smoothBaselines=1,maxDiameter=250,snapToBlack=0,midGray=0,snapToGray=0,keepColor=0,flattenBaselines=1) // colors darker than snapToBlack become black and colors lighter than midGray don't change BackgroundMaker (hiQ=1) // PageSaver () // creates page info files in C:\Temp SynthFontCollator () // merge page fonts into doc fonts //Process_Page_Pass_2: // SingletonRejecter () // BackgroundMaker (despeckle=1,vectors=0,dewarp=0) // SanityChecker () Finish_Document: DocFontCombiner (nt=32,mm=3000) // nt: number of templates threshold, mm: max M CompositeSmoother (blurLevel=1) // 0=none, 1=mild, 2=more CompositeThresholder () // convert doc template bmps to bilevel