Adds advanced option to render starting url in webview (now performed on editing / adding starting url) This can help to pass certain DDOS checks.
v12.7.2 April 2023
Fixes an issue causing some missing anchors to not be reported.
When testing anchors, if the anchor fragment of a url isn't found on the target page, link is now coloured red in the views and treated as a bad link, rather than being coloured orange as before.
v12.7.3 April 2023 (Integrity Pro only)
Fixes Images not appearing in the SEO / Images table
Adds double-click to SEO / Images tab for preview. Also context menu with Preview / Copy URL / Open URL
v12.7.2 April 2023
Fixes an issue causing some missing anchors to not be reported.
When testing anchors, if the anchor fragment of a url isn't found on the target page, link is now coloured red in the views and treated as a bad link, rather than being coloured orange as before.
v12.7.1 March 2023
(Plus and Pro)Reinstates 'Export Image XML Sitemap' which was a feature of v10 but has been unintentionally missing from v12 to date
(Plus and Pro)Tidies up the image checking functionality internally. Images must be included in the scan if images are to be included in the sitemap xml or an image sitemap, so image checking is forced on if either of these settings are enabled. To disable image checking (for efficiency with certain operations) then all of these settings need to be switched off
(Pro)Fixes Description Length column of SEO table showing incorrect value
(Plus and Pro)Fixes empty lastmod tags appearing in sitemaps where no date is available. Tags are now correctly left out if no date is available (optional tag). NB Google says, if no date is available, don't insert date of sitemap generation
v12.7.0 February 2023
Some fixes to the 'check anchors' functionality. Note that the status will be the http status regardless of whether the anchor is good or bad, ie '200 no error' even if the anchor doesn't appear on the target page. In the case of a problem with an anchor, a warning is created on that url which can be highlighted in orange depending on your Preferences, or shown in the warnings table in the case of Integrity Pro and Scrutiny.
Fixes the option Preferences>Views>Display labels>'Display labels for redirected URLs'.
An efficiency improvement which may help with a slowing-down problem experienced by some users and will generally make the app more efficient for medium to large sites.
v12.6.2 January 2023
Fixes double-quotes in url (even if percent-encoded) corrupting bad links (by link) csv
Fixes crash which happened under unlikely circumstances
v12.6.2 January 2023
(Pro only)
Fixes pdf documents (if being checked) were being included in some SEO tests that should only apply to html pages, such as missing title / description
The default threshold for the 'too many links' SEO test has been raised to 1000. This is no longer a very relevant test but may still be useful to know if a page has an excessive number of links
Similarly, default keyword density threshold increased to 6% for more relevant results
v12.6.1 January 2023
Fixes a bug that could have caused spurious statuses for certain urls (caused by containing a percent symbol which isn't part of a percent-encoding, which is a bad practice anyway.)
Fixes some urls with unlikely character combinations in the querystring or path to misleadingly display in tables as the domain only
Some social and calendar 'add this' links are now listed but not checked. It's potentially not a good thing to request these urls to check them. They are major domains (eg Google, Outlook) and tend to be generated by plugins, and so more likely to give a false positive when tested than to actually be broken.
v12.6.0 December 2022
Better handling of html5 picture tag
(Plus and Pro) Preferences for Sitemap (whether to include unique image urls, audio, video, pdf) are now all switched on by default
v12.5.3 December 2022
In the 'by link url' table, the starting url may have shown 'appears 0 times' in the On page column if the exact url didn't appear as a link (eg if you start at mysite.com but all 'home' links point to /index.html). Technically true but inelegant. Now shows "starting url"
(Plus and Pro) Restores 'Check for updates' under Application's main menu. If enabled, an auto update check will be performed on startup and display a button in the main window. The auto update can be switched from within the manual update check window.
v12.5.2 November 2022
Fixes a problem that led to the final status of certain urls (a redirect followed by an error) showing the 3xx redirect code rather than the final error code.
Fixes bug with the redirect table in the Link Inspector not showing the starting url on the first row but the first redirect url
Fixes a problem that could (under fairly unlikely circumstances) lead to a page incorrectly being excluded from the sitemap for being marked 'robots noindex'.
v12.5.0 October 2022
Improvements to parsing srcsets
(Pro) Improves structured data functionality:
Now allows for multiple chunks of json-ld on a page
Now reads multiple items within @graph properly
Displays the top-level @types from json-ld in the SEO->Meta data table.
v12.4.7 October 2022
Fixes Preferences>Views>'treat blacklisted urls as bad links'
Fixes appearance of headings 'outline' view in page inspector
Other small fixes
(Plus and Pro) Adds 'Delete configuration' to context menu
Fixes sorting by column in 'by status' view, plus small efficiency related to building by status view at end of scan.
v12.4.5 September 2022
Fixes an issue that could have caused image urls after an audio or video tag to be corrupted and therefore test bad since 12.4.2.
Fixes relating to the 'on finish alert' preference
(Pro) Fixes an issue which could sometimes cause scan to slow down and stop when using the rendering feature
Updates to the manual
v12.4.3 September 2022
Improvement to 'robots noindex' search, false positives may have been seen for that.
fixes a problem that prevented the contents (src) of iFrames from being followed if the starting page consists of iframes and no other links.
Adds support for links to audio and video files within <audio src="..." / <video src="..." It has been reporting and testing <audio><source src="..."> but both markups are valid and now correctly parsed, tested and reported
(Plus and Pro) Adds 'Visit' to context (right-click) menu of website list.
Fixes problem causing css files to not be parsed for url('') images. (These image urls should be tested and reported if 'linked files' and 'images' are switched on in Options.)
v12.4.0 August 2022
Fixes problem which caused starting urls with accented characters in the domain to stall.
Urls with special characters may have appeared encoded rather than decoded in one or two places including the Link Inspector 'appears on' table and redirect table.
Improvements to soft 404 functionality:
certain social networking sites currently return a 'soft 404' which is a 200 code and a page that says "page not found" or similar. Detecting this requires the soft 404 feature to be switched on and configured. in some cases it can also require clientside rendering. External links to Twitter and Youtube are now automatically rendered in order to access and check page content (if soft404 is switched on)
Some terms are added to the default value for the soft 404 terms field. This will only affect new users or those who haven't altered the default list.
When a possible soft 404 is detected, a warning is created which explains which term was matched on the target page. This can be seen in the Link Inspector and in the Warnings table.
Adds context help to soft 404 settings in the Preferences window.
v12.3.16 July 2022
Improves sorting in Link Inspector 'Appears on' table
Adds 'Copy URL' to the context menu for that table
Fixes a problem preventing the 'bad links only' button from working properly in 'by page' and 'by status' views (Integrity Pro and Scrutiny only)
When Pause is pressed during scan, current connections are now stopped and returned to the check queue. Previously they were allowed to finish, which didn't cause a problem, but as the stats crept after pressing the button, it didn't look particularly 'clean'.
v12.3.15 July 2022
By popular request, adds 'live updating' to the 'By link URL' table (and only if that tab is selected).
it is best if the Settings or another tab is selected during the scan, as the live updating is an overhead and makes the san slower and less efficient. To help with this problem, the 'live updating' happens periodically rather than with every url. But this still isn't advised for very large sites.
(Integrity Pro) Fixes message on Spelling tab 'Spelling is disabled in settings' being displayed permanently
v12.3.14 June 2022
Significantly improved and more efficient parsing for meta http refresh. Now checks for delay in seconds within the content attribute, if small (<6s) will observe the redirect, otherwise will ignore.
Now ignores meta http refresh found within <noscript> tags.
Should make the crawl a little more efficient too, as the check for meta http refresh is made for every page parsed.
v12.3.13 June 2022 (Integrity Pro)
Some fixes and enhancements to the spelling dialog and table:
fixes '1 of 15' indicator just above the context field in the spelling dialog
fixes 'learn' button
Adds context menus to the 'Spelling by word' view - allows copying of word and url, visit url.
v12.3.11 June 2022
Important fix for all users: since 12.3.9 certain redirections could cause a crash during the scan.
v12.3.10 June 2022
Important update for Japanese users:
- better detection and proper implementation of EUC-JP character encoding
v12.3.9 June 2022
Implements Page Inspector which opens rather than the Link Inspector where appropriate (eg SEO and Sitemap tables, top level of 'by page' view, or from within Link Inspector if the link url's target is a page.)
Adds 'detailed diagnostic window for starting url', available under View menu
Correctly handles response header field 'Refresh:', performing the refresh if the number of seconds is < 30
Handles http redirect where the redirect url is empty
Handles http redirect to a mailto: tel: etc
Adds experimental preference for the Connection: header field (keep-alive or close).
Adds number of pages scanned to the main scanning status bar
Small fix to soft404 check, it had been necessary to leave the 'terms' field in order to save any changes. Now simply typing is enough.
Minor fix to the linksLimit field
Fixes to timestamp functionality - now the request date/time shows in the Link Inspector
If 'bad links only' was selected when application quit, on restart the application would be filtering 'bad links only' but the button would work in reverse, ie show all when depressed. Now fixed.
v12.3.3 April 2022
Adds ability to fully edit items in rules table (double-click to edit)
was incorrectly disabling the meta data option if the querystring option was switched on. Now fixed.
Fixes problem causing some links to be reported with no status, also resulting in "X of Y links checked" where X is less than Y when finished. This could have happened if Integrity first receives a link to a page which is different to its canonical url and happens to try to crawl that page before discovering that canonical url elsewhere.
fixes data: urls within inline styles from being reported
(Integrity Pro) adds 'Deep content' to the list of SEO filters (Threshold can be set in Preferences, default is 6 clicks from home.)
v12.3.2 April 2022
Fixes couple of problems with sitemap rules for setting priority/change frequency
Fixes a problem which may have caused extra urls to appear in the sitemap / SEO table. This is if internal links on the site redirect to a different URL (which isn't ideal anyway) the original link url was being added to sitemap rather than the final destination url.
Adds context menu to SEO table view with some useful functions
switches off some debug messages which would have affected performance of crawl when archive feature switched on.
cleans up some warnings. some data: image srcs were being incorrectly included in the warnings, marked as having no alt text. Also cleans up the formatting, a spurious number was seen following the line number.
Fixes sorting by status in link URL, By Page and All Links views.
v12.3.1 (unreleased)
Fixes issue with "x of y links checked" not matching at the end of the scan.
Efficiency improvements
v12.3.0 April 2022
Fixes problem some have experienced with larger sites: "The operation couldn't be completed. (NSPOSIXErrorDomain error 24 - Too many open files)"
Fixes 'soft 404' check
Adds an option to soft 404 check, allows you to limit the check to internal pages only (for the best results if set up properly), external only (can produce many false positives and false negatives due to the nature of soft 404s) or both.
v12.2.3 April 2022
Fixes bug, related to recurring redirect, that could have caused a hang or crash
Recurring redirects now correctly show as bad links
Fixes bug that wouldn't have caused a noticeable problem but because it caused external urls to fully load unnecessarily, the crawl may now be noticeably faster.
v12.2.2 April 2022
Defines 'recurring redirect' to be more than 12 redirects. (Greater than 3 already signalled 'redirect chain' in SEO results.)
Fixes bug which could cause 'running on' after crawl has apparently finished. Or in some cases a crash at that point. (related to meta refresh repeatedly redirecting)
v12.2.1 April 2022
Fixes bug that might have caused the crawl of some Wordpress sites to stall at the first page.
Fixes bug that may have caused crawl to stall at first page if starting url redirected to a different domain or maybe even the www of the same domain.
Improves the handling of redirects. Under certain circumstances, the information in the redirect table and the url showing in the link results tables may have been confusing.
Tidies up a few things in the interface
v12.2.0 March 2022
Fixes bug that could cause pages to not be fully scanned under certain circumstances
Fixes 'empty quote' / placeholder flagging
Fixes problems in 'by page' view after clicking header to sort by url
Fix that prevents a possible (but very unlikely) crash
Better handling of scan finishing, less chance of appearing to stick near or at the end of the scan
Handles HTTP Basic authentication protection space (as defined in RFC7617)
Alters default setting for request header field 'Connection' from 'close' to 'keep-alive'. Examples seen where 'close' causes 'Network connection lost' status.
Other small improvements
v12.1.1 March 2022
Fixes crash which may have been experienced when exporting links and choosing 'by link'.
Improves certificate authentication for client-certificate protection space - previously the scan may have appeared to hang at the end because of this.
Corrects an issue with the timeout field, which may have been incorrectly set to a low value when creating a new website config, perhaps resulting in some unexpected timeouts.
v12.1.0 March 2022
Slight change in behaviour. After pausing, views are populated, so partial crawl can be examined. Or work can start on a bit of fixing after a pause/continue.
'Stop at X links' and 'Crawl maximum X clicks from home' are moved from Preferences to site-specific settings (Main window, Rules tab). This is also a fix as these settings weren't working properly in v12.0. This is a useful way to limit the crawl if it's not possible to limit a crawl by blacklisting.
Change to the robots.txt policy. With 'limit crawl based on robots.txt' switched on, if there is a conflict, ie the same url is allowed and disallowed, then 'disallow' overrides.
Adds an information box which is triggered if the scan stalls at the first url.
If main window is closed, but application remains open, a click on the dock icon re-opens the main window.
Corrects small glitch with the SEO table, preventing full display of the bottom row.
v12.0.5 March 2022
Fixes 'mark as fixed' and 'recheck this url' within link inspector window. Also fixes some possible refreshing issues if those actions used from context menus
Fixes bug causing "ignore" rules to not be saved
Adds "File>Return settings to default" menu option. Simple but useful, particularly with the free Integrity (which doesn't allow you to create new/multiple website configs) and particularly useful in support situations (Most support issues have to do with a setting that needs to be changed, or more usually that has been changed that doesn't need to be).
v12.0.4 February 2022
Version 12 becomes the general release. Phased, beginning with Integrity Pro, web download.
v12.0.3 (beta) February 2022
Updates relating to requests blocked by Incapsula and Cloudflare
Fixes 'display labels for redirected urls' and display labels for warnings' (label means the orange colouring in the links views.) These settings appear in Preferences > Views
default setting for labeling (colouring) urls with warnings is now off by default.
More elegant handling of Delete Configuration. Previously, when a config deleted, the next was selected, without taking folder selection into account, so the user could 'jump' to an apparently random config in another folder. Now, the next config in the current folder is selected if possible.
v12.0.2 (beta) January 2022
Adds context menus to the links views, containing useful functions like mark as fixed, highlight and locate.
Many other small fixes.
v12.0.1 (beta) January 2022
Small but important fix. If a self-closing style tag found on the page (unlikely but valid) parser would ignore the rest of the page.
v12.0 (skipping v11) (beta) January 2022
Integrity Pro beta released for testing:
Rewritten engine, replaces deprecated APIs such as NSURLConnection (now NSURLSession)
The deep rewrite allows for other long-standing issues to be properly dealt with
Improved archiving, incorporates the work done in WebArch and Website Watchman
Interface brought up-to-date. Integrity Pro and Plus will have better searching of website configurations and organisation in folders
v10.4.13 January 2022
Fixes the Locate function which appears in the Links context menus (with one instance selected). This is a powerful and useful feature, often overlooked.
Other small updates related to Cloudflare and Incapsula blocking.
v10.4.11 January 2022
Small but important fix. If a self-closing style tag found on the page (unlikely but valid) parser would ignore the rest of the page.
v10.4.10 December 2021
Further small fix to 'starting with a list of urls' functionality. This concerns a list of urls with different domains, where one or more have paths. The 'down but not up' rule should work for any with paths and the 'this page only' checkbox should control whether only the links in the list are checked or whether each is crawled.
Note that this feature will only work fully and properly with the non-sandboxed (ie non-app store) versions of Integrity Plus and Pro. The next major version will be all-sandboxed and also contain the necessary measures to allow local lists and local sites to be checked.
v10.4.9 December 2021
Enhancement to starting with a list of links. It's been possible to make a list of urls to different domains in order to scan multiple sites in one scan / one set of results. Now the 'down but not up' rule is applied to urls in that list, so it's possible to selectively crawl sections of a single site.
(It is also possible to do this by setting up 'whitelist' rules, but this relies on there being links on your starting url to the areas that you want to scan.)
Note that when using the list of deep links, the trailing slash is important. A url such as peacockmedia.software/mac/scrutiny will be assumed to be a page called scrutiny and the crawl will be limited to /mac/. But a url such as peacockmedia.software/mac/scrutiny/ is assumed to be a directory and the scan will be limited to /scrutiny/
Improvements to the parsing of image srcsets
v10.4.8 November 2021
If a black/whitelist term contained a trailing slash, such as '/mac/scrutiny/', a url such as peacockmedia.software/mac/scrutiny would have failed to match because of the trailing slash and a strict string match. Now this situation is taken into account and such a match works as you would reasonably expect.
Doesn't check links to certain Wordpress files such as wlwmanifest.xml and gmpg.org/xfn/11 (along with xmlrpc.png which it was doing previously)
v10.4.7 November 2021
Integrity and Integrity Plus were incorrectly showing File > Export > Warnings, which would fail with a warning bell if chosen.
Integrity Pro would fail to export Warnings or Spellings if File > Export was used and the table in question had not been accessed in the UI first.
v10.4.5 October 2021
Some fixes to the config functionality, in particular, deleting the last config in the list, which would cause unexpected behaviour.
Now handles cookies by default. It's becoming more important and the original reason for having it off by default is less of an issue now.
For each request, the request header field Cache-Control is now set to no-cache (rather than max-age=0) which may be the better way to force a fresh version of the page
v10.4.4 September 2021
Important update for users of version 10.4.2+, If head tags exist but not body tags (which we believe is fine as both are now optional) Integrity would fail to find links on the page.
v10.4.3 September 2021
Important update for users of 10.4.1. fixes issue with the link inspector, not visiting / highlighting / locating the correct / selected page
Fixes issue where multiple <head> sections would prevent proper parsing of some of the information in the head and could lead to incorrect warnings of missing title or missing description
very minor tweaks to the server request header fields which are sent with every request
if image url is empty, alt text warning says "empty" for image url rather than being blank
NB Integrity and Scrutiny support pages with multiple head sections (with warning), no head tags, no head or body tags.
Minor correction with one of the warnings - p within heading - the warning said that p can only contain inline content (which is true) but in this case it should have said heading tags can only contain inline content.
v10.4.1 August 2021
Fixes problem - multiple instances of the same page may have appeared in the 'Appears on' list in the link inspector, if the anchor features was turned on.
(Pro) Adds ability to see context for warnings. Sometimes it can be difficult to find the problem in a page, even given a line number. A double-click on a warning in the warnings table will open an inspector which will usually show a clip from the page source in the area of the problem.
v10.4.0 August 2021
(Pro) Adds Headings table to SEO results (headings are still available in the main view)
Interface changes, for user-friendliness - in Links results, 'By Link' is now 'Link URLs' and 'Flat view' is 'All Links'.
Very minor improvements to content-type / file type detection.
(Pro) A few small fixes to the default values of the SEO results tables
(Pro) Fixes formatting of data in SEO headings columns (were unnecessarily padded with tabs)
v10.3.18 August 2021
Now always sends the Accept-Language header with the default value of '*' for all requests.
Adds field to Preferences for user to add a custom value for the Accept-Language header, in order to control which language is selected, for websites which use this header to select which localisation of the website is served.
Minor improvements to content-type / file type detection.
v10.3.17 July 2021
Improvements to robots.txt detection, parsing and applying
includes bug fix, 'disallow' terms were being incorrectly applied to externally-hosted resources
includes bug fix, disallow was working on the whole path, rather than the root, so if /reports/ in disallow list, would apply to domain.com/xxx/reports
Adds HTML validation warnings "more than one title tag found" and "more than one opening html tag found"
(Integrity Pro) Improves the html validation where 'render page (run js) is switched on (not recommended unless absolutely necessary). Previously there would have been some false positives (eg 'no doctype' when a doctype is present), some warnings would have been masked, and line numbers would have been inaccurate, because the html would have been parsed after the page render. Now an additional pass for warnings is made over the pre-rendered source.
v10.3.15 July 2021
Fixes possible false positive for the warning "...contains an = in an unquoted value..."
Adds warning where more than one meta description is found in the head.
(Integrity Pro) Fixes possible crash at end of scan
v10.3.14 June 2021
Improvement to robots.txt check. Now page status of /robots.txt is checked before parsing the page. Previously possible to parse a custom 404 page. This was unlikely to cause any issues as allow/disallow wouldn't have been found in the document but it would have taken time to parse if the 404 page was long.
v10.3.10 June 2021
Not an important update for users of 10.3.9.
Adds a diagnostic tool which could be useful if crawl stalls at the first page.
Adds much more informative About box to Plus and Pro versions. Includes selectable and copyable licence key and licensed email address, link button to app's home page.
v10.3.9 May 2021
Important fix for all users: fixes issue where crawl would stall if the starting url contains a meta refresh without a url, ie just a refresh which isn't a redirection to another url. This might have also prevented pages within the site from being crawled properly if they contained such a meta refresh, but this is likely to have gone unnoticed.
Reduces the height of the preferences window, was previously too high for some screens.
Fixes false positives reported where an srcset has a hanging comma (which is failed by the w3c validator as "empty image-candidate string").
The above situation is reported in the warnings. Integrity and Integrity Plus show these in the Link Inspector, Integrity Pro and Scrutiny show html warnings in a table.
v10.3.5 April 2021
Some improvements relating to images:
Image urls within <picture><source srcset=.... were being collected and checked even when 'check images' was switched off.
Images with querystring after the file extension were not being recognised as images under certain circumstances (eg if they had a bad status, or if no mime type is included in the response header)
Some improvements to checking list of links / local files: (web download version. Not supported with MAS version due to Apple's sandboxing requirement)
Enables files stored in certain locations outside the user directory.
Fixes problem with case sensitivity check when a file location involves a symlink
Handles certain trackback links, no longer reports them as bad links.
v10.3.2 March 2021
Fixes problem where under unlikely circumstances, spurious character(s) find their way into an image or link url, causing bad link to be reported.
Important fix - link urls within html area map were incorrectly being marked as images, which in Plus and Pro could prevent the page from appearing in the sitemap if the image map is the first occurrence of that url that Integrity discovered.
v10.3.1 March 2021
Adds 'trust invalid server certificate' (internal domain / subdomains only). Allows scanning of site while certificate is out of date or not yet installed properly.
(Pro only) Fixes bug which cause pages to be incorrectly reported as having 'robots nofollow'. (occurred when another page which genuinely is robots nofollow links back to the first page).
v10.2.1 February 2021
Fixes problem with 'flag blacklisted' option. Blacklisted url (ie 'do not check links containing...') were not being flagged properly. The option is now renamed "Treat blacklisted urls as bad links" and with that option switched on, those urls now show up when filtering 'bad links only'.
Integrity free and Plus released as v10.2.1 for consistent numbering and in order to gain some of the general fixes and enhancements that have been released in Integrity Pro since v10. They do not contain the html validation functionality which is the major part of version 10 of these applications.
Fixes filter button and sorting on new Warnings table.
v10.1.0 January 2021 (Integrity Pro only)
Adds options in Warnings table context menu to "Hide warnings like this" either temporarily or until next scan.
Improves parsing of robots.txt:
'allow' now correctly overrides 'disallow'
correctly handles pattern matching with * and $
Fixes filter button and sorting on new Warnings table.
v10.0.5 January 2021 (Integrity Pro only)
Some tweaks to the new html validation functionality
With the 'checked linked js and css' option switched on, Integrity already searched css files for image urls. But not if the 'single page' option was switched on. Now it does.
v10.0.4 January 2021 (Integrity Pro only)
Version 10 adds HTML Validation and other fixes and enhancements
html validation of all pages during scan. A list of warnings (mostly html validation but also accessibility and other warnings) is available from the Results selection screen. It's sortable, exportable and filterable (html validation / links / server errors).
Other new features:
Improves spell-checking, filtering is improved to take out filenames and some other non-text that can legitimately appear in the page content.
Makes the 'highlight' feature within the link inspector a little more robust.
When needing to identify a file type, the mime type (returned in the 'content-type' header field) if it exists, takes precedent over the file extension. There are rare scenarios where this makes a difference.
Full list of possible html validation warnings (so far):
unclosed div, p
extra closing div, p
extra closing a
p within h1/h2...h6
h1/h2...h6 within p
more than one doctype / body
no doctype / html / body /
no closing body / html
unterminated / nested link tag
script tag left unclosed
comment left unclosed
end p with open span
block level element XXX cannot be within inline element XXX (currently limited to div/footer/header/nav/p within a/script/span but will be expanded to recognise more elements )
'=' within unquoted src or href url
image without alt text. (This is an accessibility, html validation and SEO issue. The full list of images without alt text can also be found in Scrutiny's SEO results.)
more than one canonical
Warnings that are not html validation:
The server has returned 429 and asked us to retry after a delay of x seconds (a number of these indicates that you need to rate-limit your scan )
(if 'check anchors' is switched on) a link contains an anchor which hasn't been found on the target page
The page's canonical url is disallowed by robots.txt
link url is disallowed by robots.txt
The link url is a relative link with too many '../' which technically takes the url above the root domain.
(if 'flag blacklisted' option switched on) The link url is blacklisted by a blacklist / whitelist rule. (default is off) With this option on, the link is coloured red in the link views, even if warnings are totally disabled.
v9.14.3 December 2020
General Enhancements and updates
finds image url in <meta property="url" content="xxxx">, when either lazyload or look in meta tags is switched on
As a policy, now reports but doesn't test certain urls such as xmlrpc.php and about:blank. They won't appear in Warnings, but will be listed as "not checked" so that the webmaster can see that they exist on the page. Checking these urls isn't helpful. They may exist for perfectly legitimate reasons such as part of a lazyload system or pingback system.
v9.14.2 December 2020
(Pro) Adds a 'Warnings' table. Rather than having to find warnings via the links results by opening the link's inspector for each orange link, Warnings now has its own table. This shows a sortable and exportable table, which lists urls/warnings. Warnings shows certain html validation warnings and this functionality will be expanded in the near future.
(Pro) Fixes problem where spell-checker incorrectly attempts to spell-check an m4r file (if the server doesn't return a mime type in its header fields) and puts garbage in the spell check results
Updates the selectable user-agent strings and adds more (in particular, Edge and some more mobile browsers)
v9.13.3 November 2020
Now recognises and warns about unterminated or nested link tags, which are illegal in html. Previously if this problem existed on a page, it could cause some spurious minor symptoms such as a link url being incorrectly reported as an image
(Plus and Pro)Updates the Paddle licensing framework to the latest version which is Big Sur and M1 compatible
Extends recent quote escaping enhancement to link exports
v9.13.2 October 2020
(All of these updates relate to Integrity Plus and Pro but not the free Integrity)
Fix: When sitemap was exported to csv, double-quotes in any field (eg title) weren't being escaped properly
Adds preference for handling quotes in data when exported to csv. There are at a couple of ways to do this and neither works universally. Integrity's default way (replacing the double-quotes with single) isn't ideal but it should work whatever is used to open the csv. Integrity now offers the choice of all three methods.
Fix: when ftp'ing sitemap, 'ftp://' would be prepended to the server name/address whether that was needed or not.
v9.12.4 October 2020
Fixes bug preventing the crawl from starting after a local list of links has been opened
Adds an efficiency which helps when scanning a very long list of links (csv, xml or txt, thousands of links). Previously it might have appeared that Integrity would hang for some time before the scan started running
v9.12.3 October 2020
Fixes possible crash which may have happened at any point in a scan for certain websites since v9.10.0
v9.12.2 September 2020
Fixes problem where, if starting url returned a page with no links, crawl was stalling. Integrity was returning a status for the starting url but not checking images or linked files. The page wouldn't be listed in the SEO table (Pro) and it wasn't possible to generate a sitemap. (Pro and Plus)
v9.12.0 September 2020
Improvements concerning robots.txt and warnings:
small fix with parsing the robots.txt file
always parses robots.txt if present and for each url, notes whether the url is allowed or disallowed. If disallowed, a note is made in the url's warnings (warnings are highlighted in orange in the links tables, the actual warnings can be seen in the link inspector)
Adds 'limit crawl based on robots.txt' setting. (Plus and Pro) Whether disallowed pages are included in the sitemap is decided by Preferences>Sitemap>Observe robots.txt
(Pro )adds 'Disallowed by robots.txt' choice in filter button in SEO
(Pro )adds 'Multiple H1' and 'No H1' choices in filter button in SEO, plus they will appear if appropriate in the short summary above the SEO table.
fixes problem with 'warnings' filter option in links 'by status' view
v9.10.0 September 2020 (unreleased)
Improvements to image discovery; now finds and processes image urls within inline styles (if 'check images' is switched on)
Improvements to engine. removes possible duplication of link occurrences under certain circumstances. A few other small changes. The results from this new version may be slightly different but should be more accurate.
Fixes image urls found within style sheet being reported even if 'check images' was turned off.
Correctly ignores sms: links
Jumps in version number are to re-sync with Scrutiny's version number which has had some Scrutiny-specific updates.
v9.8.5 August 2020
When testing linked files, now automatically ignores the wordpress rest api files which return an unauthorised status when tested, leading to unnecessary concern.
v9.8.4 August 2020
Adds support for charset=GBK, charset=koi8-r, charset=euc-kr and some other Latin and non-Latin character encodings. For certain websites using these encodings, page titles and certain other information may have been garbled before.
Some improvements around starting your scan with a list of links. In particular, automatically differentiating between txt and csv file types (this fixes a bug where a url containing a comma within a txt file would be incorrectly split).
v9.8.2 July 2020
Fixes a couple of situations that could result in incorrectly-constructed link urls and therefore false positives.
Better handling of escaped forward slashes in urls.
v9.8.1 June 2020
The jump in version number is for consistency with Scrutiny, although many of the changes in Scrutiny 9.8 are Scrutiny-specific (relating to insecure content checks). Integrity benefits from the following changes:
Adds option to search certain meta tags for urls. Those urls will be link-checked and also checked to see whether they count as insecure / mixed content. The meta tags in question are meta name=, meta itemprop= and meta property=. This includes social media tags such as meta property=og:image
Very small fix to prevent some false positives arising from SVG masks in style sheets
v9.7.1 May 2020
(Integrity Pro) Adds 'Manage custom dictionary' button above spell-check table. This tool provides an easy way to see your list of 'learned' words (to check that you haven't 'learned' any misspelled words)and 'unlearn' any that you learned by mistake.
Diagnosis feature: If debug console verbosity is switched to 'ridiculous', the html received from the starting url is printed to the debug console.
Fixes blacklist rules table sometimes not clearing when user creates a new website config
Adds 'Links in' and 'Links out' tables to the page inspector (accessed via the 'target page' tab of the link inspector)
v9.6.9 May 2020
Fixes spurious javascript being incorrectly reported as a link in certain situations
Fixes relative links being constructed incorrectly where page being parsed is a directory url
In the warnings tab of the link inspector, if there was a warning about a redirection, it may have contained the final url twice instead of the original url and final url.
v9.6.6 May 2020
(Plus and Pro) Correctly makes sure that pages are excluded from the sitemap if the url didn't return a good status
v9.6.5 May 2020
Important release for all users. Eliminates some spurious 'bad links' by correctly ignoring <link rel = dns-prefetch / preconnect ... > which often doesn't contain a full resource url and can return a bad or unexpected status when tested.
v9.6.4 (Pro only) April 2020
Fixes a problem with the 'run JS' feature, plus a small efficiency enhancement to that feature.
v9.6.3 (Plus and Pro) April 2020
Fixes bug which could prevent the xml sitemap save dialog from appearing for new users.
For a brand new user who starts by entering a url and running without creating a new config, the 'anchors' setting is now correctly off by default
Minimum system requirement increased to MacOS 10.10 (Yosemite). Users of 10.9 should use version 9.6.1
Some changes to licensing functionality; a fairly major update to the Paddle licensing framework and Integrity's program flow at startup, but should be invisible to the user.
v9.6.1 (Plus and Pro) April 2020
Improvements to exporting functionality:
efficiencies (memory / speed) which will benefit users with larger sites
the odd bug fix, cures a possible hang when exporting
v9.5.7 March 2020
Adds new columns rel = sponsored and rel = ugc to 'by status' and 'by page' views.
v9.5.6 March 2020
Adds sortable columns to links views and link inspector for rel = sponsored and rel = ugc. These columns are hidden by default but can be shown using the 'columns' selector above each of those views..
v9.5.5 March 2020
With the new 'check anchors' switched on, urls with #anchor fragments were sometimes incorrectly appearing in the Sitemap and SEO tables.
Fixes urls being duplicated in Sitemap table under certain circumstances and settings.
v9.5.4 February 2020
Fixes bug causing redirect to not be reported if the reason for the redirect is only to add or remove a trailing slash, and 'ignore trailing slash' option is switched off.
v9.5.3 February 2020
Very important fix to the new anchor checkbox. If left on and greyed out by switching on the querystring checkbox, could cause infinite loop in the scan.
v9.5.2 February 2020
(Plus and Pro)
Small but important fix; csv export wasn't properly escaping quotes in link text
min OS is now officially 10.9 for Plus and Pro. Integrity remains 10.8
v9.5.1 February 2020
Fixes issue with new anchor feature. If an external link contained an anchor and appeared multiple times, each instance was listed separately in the 'by link' view.
v9.5 February 2020
Adds ability to test anchors. You can switch the option on using a new checkbox on Integrity's first tab.
this will cause urls like /index.html#top and /index.html#bottom to be reported as separate links (resulting in more data) and tested separately. (more cpu and time for crawl)
If a link url has a #fragment then Integrity will report the server response code as before (coloured red if status is bad). The anchor has no bearing on this. However, if the status is good, then Integrity makes a further check to see whether a name or id can be found on the target page matching the link fragment. If not, this is added to the link's warnings, and the link will be marked orange.
You can view the details of the warning in the Link Inspector
Note that the anchor check is case-sensitive. Officially anchors are case-sensitive. Some browsers may treat anchors as case-insensitive, but this doesn't mean that all browsers will and it doesn't mean that it's right.
Note that you can't 'ignore querystrings' and also test the anchors, since the anchor fragment comes after the querystring.
The filter button contains a new item 'Warnings' which shows only links with warnings, this will include links with anchors where the anchor (a name or an id) can't be found on the page
As far as the filter button is concerned, 'Warnings' doesn't include redirects, even though they're both coloured orange in the interface and the Link Inspector Warnings tab does include warnings. The Filter button allows you to separate them
The filter button option 'Redirects' will still show redirects, even if you've chosen 'do not report redirects' in Preferences.
Typing a '#' into the search field will show links which contain a #fragment (Plus and Pro only)
Warnings (which have been reported in the link inspector since v9.0) now cause the link to be coloured orange in the views. As some people like to work towards a clean set of results and may not consider the warnings important, the colouring of warnings can be switched off in Preferences > Links > Warnings. The 'Warnings' filter will still work when colouring of warnings is switched off in Preferences.
9.5 also fixes
garbage urls caused by a url containing a comma, or a data: image within an srcset.
fixes bug that's unlikely to have been noticed. If a url redirects and the redirect url has a # fragment, traditionally the rule is that those fragments are just trimmed. But they weren't being trimmed for redirect urls. That is now fixed, but of course the new preference to not ignore anchors is respected.
v9.4.3 January 2020
(Integrity Pro) Fixes bug that could cause scan to stall at the starting url if the starting url redirects and if page rendering is switched on.
v9.4.2 January 2020
Small change to the way the 'rules' work. They are no longer applied to the starting url. The previous behaviour is unlikely to have caused a problem in many cases, and it has worked like that for many years. The new behaviour is likely to be helpful in some cases and unlikely to be unhelpful to anyone.
v9.4.1 January 2020
Irons out problem causing links to be marked external if the case of the domain of a link doesn't match the starting domain. ie start at foo.com, a link to FOO.com would be incorrectly marked as external
Fixes line number column of 'appears on' table within link inspector window
Small fix - unquoted link hrefs with no character before the closing bracket weren't being logged properly, leading to some spurious results
v9.3.6 January 2020
If a meta http-refresh type redirect redirects from an internal url to an external one, then the link was being left marked as an 'internal' link. It's arguable whether this type of link (which redirects from internal url to external) is an internal or external link, but it's important for certain internal processes that it's marked as external when the redirection occurs. This was happening properly for the more usual types of redirect.
(Integrity Pro) adds 'render page / run javascript' feature. A site shouldn't prevent a user from fully navigating a site when js is not used, but if it does, this feature will allow Integrity to crawl it.
v9.3.5 November 2019
(Integrity Plus and Pro) Important fix for anyone who needs to export to csv, html or xml sitemap. Fixes crash which may have been experienced on recent versions of the OS after OKing file save dialog.
v9.3.4 November 2019
Better handling of situation where image urls are being checked and an image with alt text is within a regular a href link which also has some link text appearing after the image and within the link. The link is now correctly reported with the link text and the image url is correctly reported with its alt text
Fixes a bug causing certain links in the above situation to be missed (ie where there is an image beside the link text within a link) and where the new 'lazy load' feature is switched on
v9.3.2 October 2019
Small improvement to 'lazy loaded' image finder. Now finds video and audio urls in the source tag / data-src element
v9.3.1 October 2019
Fixes issue that would prevent Integrity from running under certain circumstances, ie on older systems (MacOS10.13 or earlier) and where the server can serve content using Brotli compression.
Integrity users on MacOS 10.13 or earlier should download this update. It shouldn't make any difference for users 10.14 or higher.
v9.3.0 October 2019
The main tables now retain their selection when sorted, as expected
Support button added to diagnostics window which shows if unexpectedly few results are found
If 492 codes are encountered (too many requests) more information is given in the Link Inspector's Warnings tab. A 429 may come with a 'retry after' which Scrutiny honours. It may also provide some information in the html of the page which follows the 429 code. All of this information is sent to that link's warnings for the user to see.
v9.2.0 September 2019
Fixes a bug causing bad links to be reported incorrectly when the link contains a fragment (#something) as well as non-ascii characters in the link
If a mobile user-agent string for a mobile browser is being used, some sites generate an 'intent://' url. Integrity no longer reports 'unsupported url' for such links.
Disables tabbing mode (View > Tab bar) which was causing confusion if accidentally switched on. (Integrity isn't document-based).
v9.1.1 September 2019
Improvement to 'lazy loaded' image functionality. Adds Blocs to the supported systems.
Adds .webp to the list of recognised image extensions (used in various places within Integrity)
v9.1.0 August 2019
Adds option to look for 'lazy loaded' image urls. There are various ways to implement lazy loading but Scrutiny should find them in the case of the most common implementations.
If a meta http refresh is within comments (including <!--[if lte IE 9]> ... <![endif]-->) then it's now correctly ignored.
v9.0.12 August 2019
Fixes small bug that was preventing the app from running on Catalina
v9.0.11 released August 2019
Adds 'line number' to link instances (the line number of the link within the html file) - there's now a column to show this number in the 'by link' view (when urls are expanded), by status, links flat view and the table within the link inspector.
Fixes bug that was causing broken images to not be shown in links view when Filter button was set to Images. The same bug may have had other symptoms too relating to broken images. (Plus and Pro)
Fixes possible problem of some repetition in the 'columns' selector of certain tables.
Fixes problem with 'Target Page Inspector' button within Link Inspector window when the Link inspector was opened from certain views
v9.0.7 released August 2019
Fixes bug with subdomain option which could cause certain external links to be incorrectly marked as internal.
v9.0.6 released July 2019
Fixes bug with word count (Integrity Pro only)
v9.0.5 released July 2019
fixes links incorrectly reported broken (link is reported with extra text or another url tacked onto the end) when the href isn't terminated by quotes or a space but the end angle bracket.
adds 're-check parent page of url' to context menu in 'links by status' view.
some fixes to the rechecking functionality when called from the By Status view.
v9.0.4 released July 2019
Version 9 adds:
Redesigned lInk inspector
puts redirects on a separate tab rather than a pop-up window
adds warnings tab, contains details of anything that gives this link an orange 'warning' status
traditionally the orange 'warning' status meant redirect(s) but now can include a number of other things
adds 'target page' tab, which shows certain target page properties and a button to access Page inspector
Page inspector
adds sortable tables of inbound links and outbound links
adds download time and mime type to page inspector
Adds detection of unclosed comment tag and unclosed script tag, these things are included in 'Warnings'. In future the number of possible things that you can be warned about will grow.
Adds Warnings into diagnostics window.
Change to the internal flow. Previously link urls were stored 'unencoded' and 're-encoded' for testing (unicode characters and reserved / unsafe ascii characters). This is fine 99.9% of the time but sometimes this can cause a problem when this unencode/re-encode cycle produces a different result form the url as it originally appeared on the page, and the server doesn't respond to the changed version. This can cause Integrity/Scrutiny to report 404 for a link which works on the page.
Internal note: entities are still unescaped (") we consider that part of the encoding of the html page
Link text now searched when using search box and by page view
Redirect chains included in warnings
Better handling of redirection from a http or https url to a tel:, mailto: etc. Does not create a warning but cancels the connection and sets the status to 'not checked'. The redirect details can be seen within the link inspector.
v8.3.15 released May 2019
Patches bug which could have caused the odd link url to be missed or a spurious link url reported if certain unlikely code appears in the page
v8.3.13 released May 2019
Fixes bug which was causing urls to be reported bad where they were found as the src of certain tags (iFrame, Embed, Script) and were not quoted.
Fixes some unexpected urls appearing in Link views when the search box is used
Improvement to subdomain comparison, internal links with subdomains may have been considered external if the starting url had a non-www subdomain. (This all depends on the 'consider subdomains internal' option switched on)
v8.3.7 released April 2019
Fixes fatal error if option to check linked files is switched on and if a css file doesn't answer UTF-8 encoding
v8.3.4 released March 2019
Adds context menu to table within link inspector. Contains Visit, Highlight, Locate (as per the buttons below, which work if you first select a page within the table)
Engine now correctly ignores 'data-' elements within link tags. This was leading to some spurious results
Further improvements to soft '404 functionality'. If target of link returns plain text rather than formatted html, Integrity now handles this. If the target page is formatted html and has a title, this is also now searched for the list of soft 404 terms.
Further small fix for a potential problem to pattern matching (as used in site search, blacklisting soft 404 etc)
v8.3.2 released March 2019
Fixes a bug causing the crawl to stall under obscure circumstances (starting the scan at a deep url, where the deep url contains an asterisk character.)
Fixes problem of 'soft 404' search returning 'near matches'. It now searches literally for the string(s) you enter.
v8.2.5 (app store release only) released March 2019
Improvements to saving sitemap xml:
better error handling and reporting
when large sitemap is broken into multiple files, these are saved into a new folder at the location that the user chooses
option added to prevent splitting of large XML file (There isn't a switch in the interface but must be set using the Terminal.)
canonical url (if pointing to a different page than the page it appears on) has always been collected and shown in the SEO table, now they are also shown (if they point to a different page) as a link instance in the links results tables.
v8.2.3 released February 2019
Corrects odd behaviour when a canonical tag appears twice on a page. This situation is handled more gracefully.
v8.2.2 released February 2019
(Plus and Pro) Minor improvements to 'check for updates' functionality.
v8.2.1 released January 2019
(Plus and Pro) Fixes problem with save dialogue not showing up when exporting certain tables.
v8.2.0 released January 2019
Able to pull image urls from css style sheets and check their status (if the 'check linked js and css files' option is switched on')
(Integrity Pro) Fixes bug causing some code to appear in stripped plain text if
tags have no whitespace between - this could cause spurious words to appear in the spellcheck
Important fix, a bug could cause crash during scan in certain circumstances (though not reported many times). This was also causing some inefficiency
Integrity, Integrity Plus and Integrity Pro are now notarized by Apple (security checked and certified). This requires that they run under 'hardened runtime' which is also a security measure.
v8.1.21 released January 2019
Search box for link results is now a literal full match
Subtle improvement to html parsing relating to comments
Better handling of SSI where the include happens within an html tag
Some engine improvements re extracting canonical url
v8.1.20 released December 2018
Small fix that can prevent a loop in unlikely circumstances with certain options switched on - a 404 page containing a meta-http refresh.
Some updates to the French localization
v8.1.19 released November 2018
Improvement to subdomain handling. The subdomain option 'treat subdomains of starting url as internal' may have not worked as expected if the starting url had a subdomain already, including www. This option should now work as expected for starting urls that include www.
(Integrity Plus and Pro) Fixes a bug with the sitemap csv export which could cause some unexpected urls in the results (no problem with the xml or other formats)
v8.1.18 released November 2018
(Integrity Pro) 'fixes' link count in SEO table. It has always been a simple count of all urls appearing on the page in question. This now includes urls such as embedded audio/video, linked files and image urls (if you're including those things in the link check). The expectation for the 'Link count' column is that it gives the number of <a href links only. Now it does.
v8.1.16 released November 2018
Fixes a couple of problems that could cause the scan to speed up above the limit set in Settings : Timeout and Delays
Change to that Limit Requests to X per minute' setting - it had originally been set to reject anything below 30. That's now reduced to 10 as some sites are getting more difficult to scan with various ways of detecting automated requests.
Fixes bug relating to the blacklist / whitelist rule table, specifically when editing a value, and removes the option for 'Only follow' which was logically flawed and should have been removed when the 'does not contain' option was added. Users should use 'do not follow urls that don't contain' instead.
v8.1.15 released October 2018
improves iFrame support
Fixes problem with img alt text being truncated if it contains a single quote character
8.1.14
Released October 2018
Important patch, obscure problem causing incomplete scan in unlikely circumstances
8.1.13
Released September 2018
Fixes crash during export of links when using the full Mojave release
Some corrections to the list of options available under File > Export
change of behaviour - when app first launches, and when a new scan is started, the 'bad links only' button is reset. (if it's left pressed from previous work, it can cause nothing to display in the Links views, causing confusion.)
8.1.12
Released September 2018
Fixes bug that may have caused crash with certain urls.
8.1.11
Released August 2018
Further work around the improvement to the meta http-equiv refresh handling .
8.1.10
Released August 2018
(Pro and Plus) 8.1.9 was incorrectly sandboxed, possibly resulting in website configurations not being visible for users upgrading to 8.1.9 from an earlier version and then to 8.1.10 Users should contact support for the solution in this case.
8.1.9
Released August 2018
Fixes 'next bad link' button in link inspector
Fixes a bug which would have caused Integrity to stall at the first url (reporting that as a 200 but going no further) under an unlikely set of circumstances
8.1.8
Released August 2018
10.14 Mojave dark-mode-ready
Different handling of a common issue: linkedIn urls returning a 999 code (even though the link may work in a browser). This is not an Integrity issue but common to all webcrawlers / testers. LI seems to detect the rapid requests and/or non-browser querystring and returns a non-standard 999 code. Integrity used to present this as a server error and count it as a bad link. Now it labels it as a warning, and does not count it as a bad link. This is because it is not necessarily a bad link, it just hasn't been possible to test it properly.
Fixes issue with meta http-refresh not being observed if the page contains content with links. (The content was being parsed for links, in favour of the redirection being observed.)
(Pro) (Build 8.1.81) Fixes bug causing no data to show when 'duplicate descriptions' is selected in SEO Filter button
8.1.7
(officially beta when released, because the 10.14 APIs are still officially beta)
Fixes bug which may have been responsible for some unexpected results for some users
8.1.6
(officially beta when released, because the 10.14 APIs are still officially beta)
Enables dark mode when using MacOs 10.14 Mojave (will respect the user's choice of dark or light mode in System Preferences)
(Pro) Enables keyword density functionality in SEO table (keyword stuffed pages)
Better handling of a recurring 'Refresh' header field which could have appeared to leave the scan hanging when almost 100% finished
Some improvements to the sorting and filtering which should prevent a short hang when using the 'bad links only' checkbox in the links results. There may still be a bit of a delay with some large sites and when the 'by status' tab is selected.
Fixes Problem with 'Images' option in filter button which was showing some urls which weren't images
Fixes problem with headings / outline in page inspector (accessed from 'by page' view and double-clicking on a page rather than a link)
Other small fixes
8.1.4
Fixes problem scanning a site locally and directory path contains a space or certain other characters.
(Pro and Plus)Adds override for the built-in behaviour which excludes pages from the sitemap if they are marked robots noindex or have a canonical pointing to another page. These options are in Preferences > Sitemap, they should be on by default and should only be switched off in rare cases where it really is necessary, such as using the sitemap for a purpose other than submission to search engines (where you do want all internal pages in the file)
Updates links within the app and dmg (support, EULA etc) to new https equivalents
8.1.3
Fix to Links/By Link table which was not remembering its column information
(b 8.1.22) ditto for By Status view
8.1.2
(Pro and Plus)Adds option for 'new settings based on selection' (in action menu below list of website configurations).
8.1.1
(Pro and Plus)Fixes a problem with pages being excluded from the sitemap (reason given = canonical points elsewhere), under certain circumstances and with the 'ignore trailing slash' button unchecked (which is checked by default, should only be unchecked if really necessary).
8.1.0
Adds support for <embed> tag
Adds detection of audio and video mime types. The filter button in Integrity Plus and Pro allows you to see audio urls / video urls.
(Pro and Plus) Adds the options to include video in the xml sitemap
Fixes case where a set of circumstances could cause the scan to appear to finish early (and error shown for first url) while scan actually continues.
8.0.12
(Integrity Pro) Adds some options for spell-checking: to ignore contents of
Note that the option to check spelling within nav, header and footer is off by default
8.0.11
(Integrity Pro) Adds columns to SEO > Meta data table for
(Integrity Plus and Pro) Fixes a problem which caused pages that are marked meta robots = noindex to incorrectly appear in the sitemap
(Integrity Plus and Pro) re-inserts the 'export csv' button into the default toolbar settings
8.0.10
(Integrity Pro) Adds option to export SEO summary headlines as csv. (Helps create custom reports using Google Data Studio or other reporting tool )
Fixes Preferences > Links > Do not report redirects
8.0.9
Further measures to reduce 'false positives' (which is a key v8 feature). In this case, 403 (forbidden), may be returned if useragent string is Googlebot or not a browser. Where a 403 is received, and the user has useragent string set to Googlebot or Scrutiny, then the url is retried once, with cookies, GET method and useragent string set to that of a regular browser
Doubles the alt text buffer, alt texts of more than 1,000 characters were regularly being seen.
Fixes Preferences > Links > Do not report redirects which has not been working properly in v8.
8.0.8 (codename 'Roland')
When user marks a link as fixed, the redirect information for that link is now correctly cleared
Now correctly handles a link where href = './'
Allows for longer srcsets (>1000 characters). Previously, truncated urls may have been reported due to a buffer limit
(Integrity Plus and Pro) Fixes sorting in Spelling / by page table
(Integrity Plus and Pro) Adds context menu to sitemap table (copy url / visit url)
(Integrity Plus and Pro) Fixes problem with context menu in SEO / meta data table, 'copy url' or 'visit url' could work on wrong url
(Integrity Pro) Adds context menu to spelling / by word table (copy url / visit url)
(Integrity Pro) Adds option to spelling / by word table to 'remove without learning'
8.0.85:
(Pro)Fixes spelling dialog not opening when checking spelling of a file:// website
Updates list of user-agent strings in the preferences drop-down list, gives Integrity a more compliant one.
(Plus and Pro)corrects possible problem when exporting, claiming not licensed
8.0.86:
(Plus and Pro)Helps those moving from Integrity Plus to Integrity Pro to migrate their website settings. Or those moving to another computer to copy the settings. Adds 'export>selected settings / all settings'. In addition File > Open will import such exported settings.
8.0.87:
Fixes recent issue with code signing, for a short time builds will not have run without lowering of security settings
8.0.7
Adds column 'og:locality' to SEO / meta data table
Fixes bug causing spurious results to appear in the links tables sometimes when using the search box
(Integrity Pro) enables toolbar 'get info' button for Spelling view
(Integrity Pro, not MAS) implements update check
8.0.6
'Don't follow nofollow links' could prevent crawl from getting off the ground.
(Integrity Plus and Pro) Fixes problem in the sorting of Sitemap by 'priority' if any rules are in play
(Integrity Plus and Pro) Fixes bug preventing sitems 'priority' column from being manually edited if the sitemap rules table is empty, and bug preventing the 'change frequency' column from being edited manually
(Integrity Pro) enables 'double click to preview' in SEO / Images table
8.0.5
Some fixes relating to linked pdf documents (Integrity Pro only)
Revised default values for optimal lengths for meta title and meta description (for SEO warnings). (see the defaults in Preferences > SEO and alter them if you like)
8.0.4 (becomes full release, no longer beta)
Fixes problem where unlikely set of circs could cause crash (certain unintended spurious character included in the link target url, a specific page encoding)
Fixes bug that prevented full scanning if port number used in the starting url
8.0.3 (beta)
Adds support for html5 tags. Finds and checks urls for audio / video / pictures
Sorts a problem with redirects, where a url is redirected to a url already in the list. Sometimes this could result in an odd status being reported, (302 < 302 rather than the correct 200 < 302)
Improvement to IDN functionality, specifically if page contains percent-encoding within domain part of url, wasn't being handled properly.
The automatic retry with cookies enabled for urls that give 'too many http redirects' (where they continuously loop while cookies are not detected) is extended to all urls (previously just external urls)
Fixes bug that could cause a hang at the end of the scan if the archive option is set, 'browsable format' set and certain things present on any of the pages.
small changes:
Check Images is now on by default
Many other small fixes
8.0.2 (beta)
Important fix re linked files in head of page, could be displayed with spurious link text
Some fixes to file size / target size functionality
8.0.1
Changes to the way that images / image size are counted
Efficiency at end of scan
Sorts out some issues with Export toolbar buttons and menu items
release of Integrity Plus v8 as beta
8.0 (beta)
Version 8 rewrites some of the underlying data structures and collects more information about your links. The biggest changes won't be visible, but the app should be more efficient, and cut out some potentially long background tasks after the scan and after re-checking links.
Better control over volume of requests ("Limit requests to X per minute" replaces the delay field)
More intelligent automatic re-trying and feedback to the user
6.12.0
Restores ability to scan website locally
6.12.0
Adds ability to attempt scan Wix site. No option for user, Wix site is autodetected using the generator meta tag
We don't endorse or encourage the use of Wix, their dependency on ajax breaks accessibility standards and makes them difficult for machines to crawl (ie SEO tools and search engine bots) and impossible for humans to view without the necessary technologies available and enabled in the browser.
Fixes bug in 'highlighting', if the link occurred more than once on the page, only the first would be highlighted properly.
Fixes minor bug in column selector above certain tables, for French users.
(Integrity Plus) Fixes bug preventing pages from being correctly excluded from sitemap where robots noindex is set in the page head
(Integrity Plus) Fixes bug causing potential crash if pages are excluded from sitemap for both possible reasons and user presses the button to see the 'more info' button
6.11.16
Fixes bug in 'highlighting', if the link occurred more than once on the page, only the first would be highlighted properly.
Fixes minor bug in column selector above certain tables, for French users.
(6.11.15 was a release to the app store only, the issue didn't affect web version)
6.11.14
(Integrity Plus) Important fix - after scan finishes, depending on certain sequence of events, Sitemap table may have appeared blank. Data should now correctly appear
Some improvements to 'rules' dialog:
Rules dialog opens as a sheet attached to the main window, rather than randomly positioned on the screen
Adds 'urls that contain...' and 'urls that don't contain....' option giving much more flexibility
(removes 'only follow'. The wording of this became confusing in certain cases (eg if you have more than one of those rules) and it's no longer required because it's the same as 'do not follow urls that don't contain' )
(Integrity Plus) Some improvements to the site management (clicking from one website configuration to another). With certain sequences of actions, unexpected results could be seen.
Other small improvements
(NB Mac App Store Integrity Plus has version 6.11.15 but this contains a fix specific to the app store version).
6.11.13
Important update for French users - when using French localisation, when making a blacklist rule ('Ignore links containing...' etc) the new rule appears not to save when OK pressed.
6.11.12
Re-release of Integrity plus to Mac App Store. Some changes relating to sandboxing and licensing but no changes to functionality
6.11.11
(Integrity Plus only)
Fixes possible crash if many urls are selected and 're-check selected' performed
Fixes problem with Preferences > Sitemap > Check for robots.txt (urls disallowed by robots.txt were still appearing in the sitemap)
6.11.10
(Integrity Plus only)
Fixes possible failure to launch if a much earlier version of Integrity had been used before starting a recent version of Integrity Plus
v6.11.9 released November 2017
Fixes problem with finding all frame urls within a frameset
Adds a trim to the starting url before starting in case whitespace / return characters have been included via a copy and paste.
Fixes problem with ftp of sitemap file, if Preferences has been set to 'ftp only' and the sitemap generation is attempted before certain other actions.(Integrity Plus only)
v6.11.8 released October 2017 (Integrity Plus only)
Fixes odd problems with the search field
Improves the user experience when the [+] (new site) button is pressed for the first time. If a website has already been configured without first creating a new config, the url, settings, rules etc will be saved as the first site before the new one is created (becoming the second site)
v6.11.7 released October 2017
Some fixes and improvements to the 'file size' functionality. And adds option to 'load all images' With this option on, all images are loaded and the size noted. So the 'target size' column of the 'by link' and 'flat views' will show the actual size of the image. With the option off, a size may still be displayed in those columns, but it then relies on the Content-Length field of the server response header, which may be the compressed size of the image or not present. The option slows the scan and uses more data transfer, so only use if you're interested in the size of images on your pages.
Fixes odd results if a link is an anchor link and contains unicode characters within the anchor
v6.11.5 released September 2017
Fixes links incorrectly found within javascriopt
Fixes problem causing bad link count to be a little higher than the actual number of bad links. (Caused by certain external urls responding with error butreturning OK when automatically retried, the bad link had already been counted and wasn't reset)
v6.11.4 released September 2017
important release for users of High Sierra
Fixes problem that could cause incorrect link text to be reported
Where appropriate, Integrity uses the HEAD method for efficiency. However, some servers incorrectly return a 404 or 5xx in response to a HEAD request. Such urls are now automatically retried using GET.
v6.11.3 released August 2017
Adds case sensitivity when checking file:// urls there's a new option on the 'Global' tab of Preferences, case sensitivity is on by default.
Fixes bug which prevented some srcset (2x etc) images from being found
Increases stability and efficiency under certain circumstances
Fixes minor problem with the 'delay' functionality (for throttling requests). The bug caused this setting to sometimes not be observed.
Fixes incorrect handling of base href = single forward slash, now correctly interprets as "relative to the public root"
v6.11.2 released August 2017
Fixes bug causing scan to stall if crawling locally and site is on an external volume
Enables 'Find' (cmd-F) within debug console
Keyboard shortcut for main views are changed - shift added (cmd-shift L, T, S, F)
v6.11.1 released August 2017
Fixes bug causing incorrect redirect if a port number is part of the url and the same url redirects multiple times
Adds debug console with verbosity control
v6.11.0 released August 2017
Adds French localisation to entire app
v6.10.2 released August 2017
Fixes bug causing html pages to not be added to SEO results or Sitemap if it contained no links
Adds options to ftp dialog (sitemap export) to use TLS, and adds field for port number (defaults to the usual 21)
Some other small improvements such as validation of the directory field
v6.10.1 released July 2017
fixes issue with links not being found after self-closing script tag in body (<script .. />)
fixes issue with <img src-data= causing a garbage link to be reported
Adds ability to begin crawl at an xml sitemap where the sitemap is a sitemap index file which links together a number of xml sitemap files
More improvements to the new advanced options which will be helpful in a small number of cases
Where a timeout is encountered, Integrity will now invisibly retry once, in case it's a spurious or short-lived problem
fixes bug with sitemap generation, if sitemap was large enough to need splitting into multiple files, one was being missed from the sitemap index file
v6.10.0 released July 2017
Improvements to engine, may help with certain sites where timeouts are experienced, maybe randomly or maybe abruptly bringing the scan to a stop. A new 'advanced' preference added to override the new changes and force all connections to stay alive to completion of data load. Not to be used generally (especially where the site contains links to large files) but may help in some situations.
Adds fix for problems experienced with certain unicode characters since 6.9.0 when IDN support was added.
Fixes issue where some relative urls were incorrectly being punycode-encoded under certain circumstances.
v6.9.2
Fixes a bug with the 'do not report redirects, just the final status' Preference. (The 3xx statuses were correctly not being reported, but the actual redirect url wasn't being logged, this was leading to some small and obscure problems.
Bug fix, prevented proper running on 10.8 (claimed min system requirement OSX 10.8 restored)
v6.9.1 released June 2017
Fixes parsing problem that could cause spurious links (incorrectly found within javascript)
Small change that helps stagger multiple simultaneous requests
Adds French localization to context help
v6.9 released June 2017
Adds support for IDNs - start with either the unicode or encoded version, the unicode version will be displayed, the http requests will be correctly handled using IDNA encoding
NB - Integrity has long been able to handle unusual characters in the path / filename of a url using encodings such as percent -encoding. This refers to unicode characters in the domain part of the url.
v6.8.25 released May 2017
Fixes possible crash on completion of scan under certain circumstances
Allows generation of a sorted list of images by file size, and which pages they appear on (adds 'target size' column (optional) to the Links 'by link' and 'flat' views)
v6.8.24 released April 2017
Adds 'copy urls' to the context menu where multiple items are selected in all link results tables. (cmd-C also enabled where multiple items are selected). a return-separated list of the selected urls is copied to the clipboard.
Fixes a crash when using multiple select and 're-check'
v6.8.23
A number of fixes around the sitemap functionality, exclusion of pages from the sitemap and canonical urls:
Adds a button for viewing pages which have deliberately been excluded from the sitemap. It opens a table showing the url, canonical url and the reason that the page has been excluded. The table has context menu for copy url and visit
Where a page has a canonical url pointing to itself, this page may have been incorrectly excluded from the sitemap in the past if the canonical url's capitalisation is different from the page url. This match is now checked in a case-insensitive way.
Other small fixes
Fixes obscure problem, canonical and other links in the head truncated if url contains /head
v6.8.22
(Integrity Plus) Adds menu items to Filter button - Redirected and PDF documents (to easily see only those things)
Fixes crash or hang if starting url is a file and that file can't be found, and dock icon is showing progress bar.
Fixes problem with wrong starting url sometimes being used after File-Open dialog.
Inherits a fix to the engine, not always recognising an end comment where it looks like -------------->
v6.8.20
Fixes a problem causing Integrity Plus to quit on startup after a certain sequence of events including starting the free Integrity beforehand
Fixes logical error which meant that if user viewed the 'by status' or 'flat view' while the scan was running, these would not be updated properly at the end of the scan
Adds Googlebot's user-agent string to the drop-down list of UA strings in Preferences
v6.8.19
Some improvements to the engine, including low disk space detection - offers to stop or continue before space (on the system disk '/' ) becomes critical
Some fixes to the 'mark as fixed' function - fixes keyboard shortcut (enabled in by link view only), enables multiple selection & mark as fixed in the by link view, and fixes the 'follow up' for that (removing from view if 'bad links only' is in operation)
Some users of Integrity Plus who downloaded between 1 and 4 Jan 2017 and attempt to buy in-app may see an error. Please download a new version and try again.
v6.8.18
Further improvements to the help system.
Unfortunately, OSX components that enhance the help system are available in 10.8 and above. Therefore this version requires minimum 10.8. Users of 10.6 and 10.7 should use Integrity version 6.8.15 and Integrity Plus version 6.8.17.
PeacockMedia's end user licence agreement version v1.2 (published 25 Nov 2016) applies.
v6.8.17
Much improved context help system. Discreet 'i' buttons beside many settings pop up some useful advice about the setting, with a button for the support form.
Fixes hang or unpredictable behaviour if a new configuration is made and the starting url entered is a local file.
Enables 'File > Open' for opening local files (html files to be scanned locally, or xml sitemap or csv file containing links)
v6.8.15
Fixes bug causing links to have blank url if the found url contained a particularly unusual percent-encoded character or one that doesn't convert in the claimed encoding.
Now ignores link targets in double curly braces, ie href = "{{ something}}" - used as placeholder in certain content management systems (ie Angular, Expression Engine). Previously Integrity was incorrectly constructing an absolute url and testing it. Note that such links can be rendered properly and tested using Scrutiny's 'render javascript' feature.
v6.8.14
In case where a page uses the Refresh server response field, and has a large time delay, this could cause Scrutiny to hang at the end of the scan.
v6.8.12
Adds multiple selection to by link, by status and by page tables, (these can of course be sorted and filtered in Integrity Plus) and the context menu item 'Re-check selected'. This is a replacement for the old 'Recheck bad links' menu item which was flawed in many ways.
Fixes obscure problem where /head appears within the canonical url, this mistaken as the /head tag, leading to some spurious code appearing in the link results.
v6.8.11
Important fix for anyone scanning locally. Fixes bug present since 6.8.6 which could cause scanning of local files to stall.
v6.8.10
Fixes a crash experienced if using the 'recheck this link' button in the link inspector, and then 'next bad link' before the recheck has finished
v6.8.9
When XML Sitemap is generated, if the file is larger than 10MB or 50,000 URLs then it will be broken into multiple parts. You only need to specify your filename once, the first file is a sitemap index file using the filename that you specify (eg sitemap.xml) and additional sitemap files are numbered (eg sitemap-1.xml, sitemap-2.xml etc)
Note that the links within the sitemap index file (if generated) will be *relative urls* (eg "sitemap-1.xml", "sitemap-2.xml" etc) We're so far unable to establish whether this is acceptable. If not, it may be necessary to edit those urls to include the full web path.
Percent-escapes spaces in urls when generating xml sitemap
v6.8.6
Adds support for the server header field "refresh". (Not official web standards but has been supported by most browsers for a very long time.)
Improvement to completion of referer field in http requests where a redirect is concerned
v6.8.4
Important fix - fixes some spurious non-existent links found when hreflang is present within <link > or <a > tags
v6.8.2
Adds much easier way to select columns for certain tables (flat view and by link) - a menu pulled down from a button just above the table. Similar menu available in export dialog too
Fixes problem with 'exporting disabled' message appearing even after licence is activated
'Distance', now renamed 'Depth' where it appears in tables
Fixes possible mistaken links 'found' within javascript
Now makes sure quotes are trimmed from meta refresh url
Some ../ weren't being correctly resolved if they appeared within the middle of a relative link - improved now
Adds preference to be tolerant (ie not report a problem) in cases where a ../ travels above the root domain. Although technically an error, browsers tend to tolerate this (assuming the root direcory) so such links will appear to work in a browser
Small fix to meta refresh redirects
v6.7
Adds pattern matching in blacklists / whitelists. * and $ can be used
Link inspector now remembers the size the user has dragged the previous one to
links limit in Preferences is capped. Previously, entering a stupidly higher number could cause problems.
Fixes bug causing some spurious data to be included in the link check results, when 'check linked js and css files' is switched on
Reduces some initial memory allocation - more memory efficient when scanning smaller sites.
v6.6.5
Fixes unlikely bug causing (under very unlikely circumstances) links to be listed with doubled-up characters, eg hhttttpp:://// etc
Fixes bug causing absolute urls to be constructed incorrectly (under certain unlikely circumstances - if a page is redirected more than once, the first redirect url has a trailing slash but the second doesn't.)
v6.6.4
Fixes bug with detecting and handling http meta refresh
Adds 30s timeout to ftp of sitemap xml (Integrity Plus)
Adds 'only include each image once' option to xml sitemap (Integrity Plus)
v6.6.3
handles a problem where unexpected things are present in the base href. In unlikely cases this could lead to a crash, now handled gracefully.
Improvements where <script> tags are concerned, cures problem of spurious links incorrectly being reported where a link was found within <script> tags
v6.6.2
Small fix for an issue causing some instability where I or I+ is running on Yosemite and a page has a link to docs.google.com or drive.google.com
v6.6.1
Fixes certain items failing to expand in 'by page' table
Fix to highlighting feature with html5 pages
v6.6
Adds option to check linked files (linked external stylesheets and javascript files etc) while scanning
Fixes a number of problems related to filtering / sorting of your list of sites, including confusion of two configs with the same starting url
v6.5
Adds preference to consider links to the http version of the site to be 'external' (ie don't follow & don't include in sitemap)
New preference is switched on by default
Note that the second preference to alert the user / show details of links to the http version of a site is a Scrutiny feature. The information can be found in Integrity's results, just the alerting / presentation of the information that isn't included in Integrity / I+.
v6.3.4
Fixes html export of sitemap (Integrity +)
v6.3.3
Fixes auto update check (is in I+, currently not Integrity)
v6.3.2
Fixes problem with colour labels in 'by link' view when rows are expanded
Fixes empty cells in html export of flat view
v6.3.1
Fixes problem since v6 with finding link urls within image maps
Improves 'archive' functionality: (Integrity Plus only)
adds option: don't show save dialog each time (remember and use the same location each time)
also adds option to process the archived pages rather than just dumping the html as before - process inks / images etc within archive, and recreate directory structure so that pages display properly in a browser and can be browsed. (a la sitesucker)
These options are available from an 'options' button beside the 'Archive pages while crawling' setting
Adds 'ignore' to the options for blacklist rules, eg "ignore urls containing..."
Fixes bug causing blacklist rules to be 'remembered' when scanning, after they've been deleted from the table
small improvements to engine for html5 pages
small efficiency / speed improvement
small increase in price (Integrity Plus only)
v6.1.2 (changes relating to Integrity Plus only)
Fixes problem of blacklist terms persisting after a new site is created
Exporting disabled when app is running in trial mode
Fixes bug causing export > sitemap as... to beep and take no action if the export is requested via the menu item and the sitemap tag has not already been selected at least once
v6.1.1
Fixes problem with 'don't follow nofollow links' setting, was causing incomplete crawl
Fixes problem with 'sitemap options' preference and not performing ftp
Removes preference 'check for robots meta tag', because it should always be checked and observed (checking for the tag has no real impact on the crawl time). Makes no sense to give the user the option of including a page in the xml sitemap which is marked "robots noindex"
v6.1 (1)
improvements inherited from v6 engine:
fixes bug causing commented out titles etc in head not to be ignored
fixes a problem preventing a site from being crawled properly if the starting url redirects to a different domain / subdomain.
fixes some instability in 6.0.x
fixes a problem causing missed links in html5 pages, specifically those that use tags
fixes a problem causing links in a text list to not be followed
If starting url is ambiguous (could be a page or directory) Adds a dialog to check this with the user as this affects the scope of the scan (remains within the starting directory)
v6.0.13 (1)
Fixes 'tolerance (trim whitespace or mismatched quotes)' button
Fixes problem with default filename for local crawling
Better handles extremely large html files (multi-Mb)
Fixes problem in 6.0.12 relating to coming out of beta, a limit was unintentionally left in place
v6.0.12 Released Nov 2015
First non-beta release of v6, Integrity and Integrity Plus have the 'v6 engine' which is faster and more efficient.
v5.4.5 Released Sept 2015
fixes bug causing old pages with certain specific characteristics to be 'status checked' but not parsed for links resulting in not all pages being scanned. (Unlikely to have affected many sites)
(Integrity Plus) fixes bug causing urls with bad status (4xx or 5xx) to appear in xml sitemap if labels are switched off in Prefs
v5.4.4 Released Sept 2015
Improvements around xml sitemap (Integrity plus)
Adds info about pages excluded from sitemap for any reason
Properly excludes urls from sitemap where canonical url points to a different url
Fixes problem with plus disappearing from application icon while progress bar is showing (Integrity plus)
v5.4.2 Released Sept 2015
Implements sorting in By page view
Improves support for 'meta http refresh' type redirects
Improves soft 404 check - better at finding the target text in external pages and pages that are redirected by meta refresh
Includes fix for crash experienced where the site includes link(s) to docs.google or drive.google and Integrity is being run on OSX Yosemite. Crash will have started happening since the beginning of September 2015 and will have happened at a consistent point in the scan.
v5.4 Released July 2015
Adds 'By status' view
Allows sorting by any column and has a context menu to copy the link url, redir url or 'appears on' url
has an option button allowing you to group redirects by initial status, final status or the combination
v5.3.1 Released July 2015
Adds dialog with a preview when user is exporting links to csv or html. Default is the flat view but all options are available and made clear in the dialog
v5.3 Released June 2015
Some fixes and improvements to xml sitemap:
Allows editing of change frequency within the results table. Changes made are 'remembered' for future scans of the same site
Adds 'match' column to sitemap rules table (partial match or match whole string)
Other fixes / enhancements
better handling of base href. Now handles relative base hrefs and 'relative to root' ("/") properly
v5.2 Released June 2015
Adds support for redirects by meta http refresh.
v5.1.9 Released June 2015
Fixes problem checking links within text/csv files and xml sitemaps (since 5.1.5)
Fixes problem with text encoding affecting parsing xml files
v5.1.8 Released June 2015
Some changes designed to help crawl very, very large websites:
The 'don't check external links' option now prevents Integrity from listing external links at all, thus reducing the data stored
A new option in Preferences to limit occurrences. For each occurrence of each link, a number of strings are stored (url of the page it appears on, link text and more) so using this option with a small number (minimum of one occurrence) will again reduce the amount of data stored
Other enhancements
Adds dialog with a preview when user is exporting links to csv or html. Default is the flat view but all options are made clear in the dialog
Auto update improved, gives more feedback
Fixes
Fixes bug with 're-check this link' - after using this option, only the final status code was shown, no redirect
Version History
v5.1.7 Released June 2015
fixes issue with image srcset where only one url appears
v5.1.6 Released June 2015
adds a progress indicator to the count field (x of y links checked, z bad)
Fixes bug causing unexpected results if hreflang appears in links ahead of the href
v5.1.5 Released June 2015
Enhancements:
adds support for img srcset - all image urls are found checked and reported (assuming 'check images' is turned on)
Soft 404 function looks for terms in text content, not in whole page source
Efficiency improvements
efficiency - large files of unknown mime type are assumed not to be html and not downloaded
Fixes:
img alt text is now found if it appears in the tag before the src or srcset
couple small bugs fixed related to 'soft 404' check. If switched on with images switched on too, large amounts of messages could be written to the Console. If switched on, could cause hang at completion of crawl (which is overcome by pressing 'pause' and then 'continue')
v5.1.1 Released April 2015
Fixes bug which was causing crawl to loop ad infinitum if #! appeared in the url (though websites using this to generate dynamic content seem to rely on js. If there is no alternative starting url which gives an html page (eg for search engine submission) then Scrutiny has the necessary 'render page/run js' feature)
v5.1.0 Released January 2015
Handles images where src = "data:...."
Fixes bug which may have been responsible for a crash experienced by a very small number of users (crash characterised by 'stack overflow' in the crash report)
Internal changes to enable consistency with Integrity Plus
Version numbering system of Integrity and Integrity Plus will be consistent
v5.0.12 Released Dec 2014
Corrects problem with 'Wordpress or other SEO-friendly urls' feature
Fixes problem with dock icon reverting to the old one while running
v5.0.11 Released Dec 2014
Adds checkbox to Settings screen 'Wordpress or other SEO-friendly urls'. This needs to be checked when a url is in the form mysite.com/publications/all-publications/ where all-publications is a page not a directory. Without the checkbox checked, Scrutiny would regard /all-publications as a directory and limit its crawl to urls within and below that 'directory'
Application icon reluctantly 'Yosemitized'
v5.0.10 Released Dec 2014
Alters csv exports slightly, row separators are now LF character (Unix-style) rather than CR, for easier parsing
v5.0.9 Released Dec 2014
Fixes bug that could cause crash when scan completes
v5.0.8 Released Nov 2014
Fixes instability experienced by some users
fixes bug related to support for ISO-8859-1 encoding and adds ISO-8859-2
v5.0.7 Released Sept 2014
A number of enhancements relating to character encoding:
More character encodings added to the list of supported encodings. Adds Thai encodings (windows 874 and TIS-620), Japanese (Shift_JIS) and some Simplified Chinese (windows simplified chinese, HZ_GB_2312 and GB_2312-80)
Reads the 'charset' attribute of every page (previously a detection was performed on the first page and the encoding used for the whole site)
Other enhancements and fixes:
Adds selection button beside the User Agent String field, populated with a few common browsers
v5.0.6 Released August 2014
Although not recommended in urls, support added for entities in the html (eg ') all known named and numbered entities handled, ascii and utf (some named entities such as ' were being handled previously)
Fixes bug that could cause external pages with querystrings to be duplicated in the links list when 'ignore querystrings' is checked
v5.0.5 Released August 2014
Supports feed:// urls
Adds transparency / opacity to colour picker for highlighting (Preferences>Views), allowing users not to see highlighting for redirects but for 4xx and 5xx errors, for example.
v5.0.4 Released August 2014
Fixes problem with showing 'bad links only' in flat view
v5.0.3 Released June 2014
When crawling html files locally, now adds filename if necessary. Default is 'index.html' but this can be changed in Preferences > Global
Fixes bug that could cause Integrity to go into a loop when crawling a site locally & continue until reaching the preset maximum number of links
Improvement to print button / menu item - detects and prints the frontmost table
v5.0.2 Released June 2014
Fixes 'missing link text' bug in 5.0/5.0.1
truncates urls in html export, avoiding silly column widths
Uses shorter format for date stamp, easier to read, reduces column widths and file sizes of exports
v5.0 Released May 2014
New features:
Now supports urls which include non-ascii characters (although not in the domain, IDN's still unsupported). Some may argue that this is against web standards, but it's becoming more common and accepted by Google and browsers
Auto-detects character encoding of pages, character encodings now supported include CP1251 (Cyrillic script eg Russian, Bulgarian, Serbian Cyrillic)
Adds progress bar to dock icon
Opens and scans a list of links in html, plain text format or xml sitemap (automatically detected)
Blacklisting / whitelisting is no longer applied to starting url. Previously, starting url had to pass black/whitelist test otherwise crawl wouldn't get past the first page
Better handles urls with port numbers (problems experienced with some servers re urls with a port number when returning a redirect)
Fixes and improvements:
[edit: There used to be a note here about scanning Wix sites, but this is now out of date. For the latest on Wix support, please contact Support.]
Better handles entities involving a hash (eg ') within a url. Previously was truncating the url at the hash assuming it to be a fragment/anchor
Fixes bug causing spurious text to be reported as the link text if an image has alt = ""
Adds character encoding tag to head of HTML exports
Correctly removes all temporary files when application quits. v4 and before had removed temporary files only when starting a new scan. previous points of v5 had not removed all files.
Improves csv and html export, these now reflect the sorting / filtering of the table being exported
Correctly handles links using ./ (same directory)
v4.5.3 Released January 2014
Fixes and improvements:
Handles 'callto:' links (already handled skype: and tel:) No longer reports them as bad links
Adds 'Visit' button to link Inspector window
New application icon with less transparency (previous one difficult to click)
Fixes problems with 'Last checked' information
Sandboxed and code-signed for your security
v4.5.2 Released December 2013
Fixes and improvements:
Recognises an xml sitemap file, File>Open and Integrity will test all of the links within it. (Has previously been able to test the links within a text file in plain text or html format)
Fixes problem which could lead to incorrect information in the 'occurrences' of a link where another url redirects to that url
The above fix will lead to slight differences in the results for some sites (a small increase in data). The new version should be more accurate
Integrity running on previous versions of OSX isn't tolerant to links which try to access a folder above the domain (eg foo.com/../somepage.html) due to changes in Mavericks, such links are reported as fine. from 4.5.2, Integrity traps such links and reports them as badly formed. Note that some developers consider such links fine because they are generally tolerated by browsers (they ignore the parent directory instruction) but they're technically incorrect and there are no plans for Integrity to have an option to tolerate them
Turns off some unnecessary console information and clears up some console warnings
Fixes a bug which was causing some instability under certain circumstances and an occasional crash when clearing the results of one site and starting the crawl of another
v4.5 Released October 2013
Better 'soft 404' support - a list of terms which are searched for in the url and content of internal and external sites.
You can customise this list to find soft 404s within your own site or add terms found in external soft 404s
You can switch the feature off (in Preferences) if you have a large site and want best performance and this isn't important to you
v4.4.1 Released September 2013
Retina screen compatible
OSX Mavericks tested and supported
Improvements to interface:
Main window's Toolbar redesigned in line with Apple's human interface guidelines and for retina screen compatibility
Adds toolbar controls (show / hide / customise) to main View menu
Small bug fixes and enhancements:
Fixes two small and unrelated bugs causing odd results if nofollow switched off and base href present but set to ""
Expandable views will only expand when crawl is paused or finished. This improves speed and efficiency, prevents crashes related to memory on older systems
Now indents data for expandable views when exported as csv, html
Minimum system requirements OSX 10.5. 10.4 users should not upgrade to v4.4, for compatibility with newer systems it uses features not available in 10.4
v4.2.2 Released June 13
Fixes problem with new link text column in By page view not always displaying accurate data where same link occurs multiple times on same page
Tweaks how information is displayed in new expandable By link view
v4.2.1 Released June 13
Fixes crash on launch for some existing users
v4.2 Released June 13
improvements to interface:
- Changes the 'by link' view to an expandable view, occurrences can be seen by expanding view rather than as previously having to open the link inspector
- Link inspector still appears on double-click from link views and is improved
- Adds context menus to the 'by link' and 'by page' views and the 'appears on' table in the link inspector - a number of actions can be performed with a right-click (or control-click) including 'Copy URL' and where appropriate Visit, Highlight and Locate
- The new Copy URL action is available with a command-C and will copy the URL of the selected item
- A new Locate action lists how to click through from the starting url to find the link in question. It is available via context menus, the link inspector and cmd-shift-L
- Adds 'link text' column to 'by page' view
- Change to wording: 'on page' now 'appears on'
- Changes default for highlighting a link on the page - now looks like highlighter pen rather than a box around it (changes prefs defaults to 'background' rather than 'border', and changes the default colour to yellow rather than dark grey - ie (existing users can select this option in prefs if they like))
Ignores and continues if 'bad SSL certificate' warning is encountered. But only for the website being tested. (anything else, ie external links, won't be followed anyway)
If image checking is switched on, now collects alt text and displays in 'link text' columns
Some options removed from Preferences>Views>By Link view (Status, URL, On Page) because these are needed for the new outline view to work properly
Exporting from 'by link' view better than previously. (was putting all 'on page' information in a single cell to reflect the view - led to problems due to Excel's 256-character limit)
Export added to by Page view
Exports from expandable views reflect the state of the view, ie which rows are expanded or not
v4.1.1 Released April 13
Adds 'bad links only' checkbox (was possible to toggle bad links / all links using menu or toolbar button. This new checkbox makes the option more obvious if toolbar not showing and for similarity to Scrutiny)
Adds filter drop-down list (All, Internal, External, Images) and search box above all views
If flagging blacklisted urls, then the highlight colour used is orange or the warning colour (was red or bad link colour). Not an error so inappropriate to use an error colour.
v4.0.4 Released February 13
Fixes problems creating black/whitelist rules on first run with no settings saved
Correctly sets window to edited (dirty spot in red button) when black/whitelist rules are changed, triggering prompt to save when switching settings
Version 4.0.3
Small fixes, full release February 2013
Version 4
released January 2013
Major improvements to the data storage and engine meaning that even small sites will crawl more quickly and large sites will crawl very much more quickly without slowing down or losing responsiveness
When stop button is pressed, all open threads are abandoned, and then recreated if 'continue' is pressed. Gives a much better user experience
Routines for 'by page' view re-written to avoid apparent hanging at the end of the crawl of a big site
Adds new settings to Preferences, allows setting of some limits - default to 200,000 links. Offering the option of limiting the crawl of a large site (maybe better achieved by using blacklist / whitelist rules) but also a safety valve to prevent crashing due to running out of resources when crawling very large sites
If starting crawl within a directory, crawl is limited to that directory, ie crawl will go down a directory structure but not up. This matches users' expectations. Previously, crawl extended to all pages in the same domain
Blacklist and whitelist boxes replaced by a more user-friendly table of rules (existing data will be presented in the new way)
Moves 'check links on custom error pages' to settings rather than global preferences
Increases maximum number of threads from 30 to 40 (will improve crawling for some sites) with the default now 12 rather than 7. Extreme left (labelled 'fewer') is still a single thread.
Version 4.0.3
Small fixes, full release February 2013
Version 4
released as Release Candidate January 2013
Major improvements to the data storage and engine meaning that even small sites will crawl more quickly and large sites will crawl very much more quickly without slowing down or losing responsiveness
When stop button is pressed, all open threads are abandoned, and then recreated if 'continue' is pressed. Gives a much better user experience
Routines for 'by page' view re-written to avoid apparent hanging at the end of the crawl of a big site
Adds new settings to Preferences, allows setting of some limits - default to 200,000 links. Offering the option of limiting the crawl of a large site (maybe better achieved by using blacklist / whitelist rules) but also a safety valve to prevent crashing due to running out of resources when crawling very large sites
If starting crawl within a directory, crawl is limited to that directory, ie crawl will go down a directory structure but not up. This matches users' expectations. Previously, crawl extended to all pages in the same domain
Blacklist and whitelist boxes replaced by a more user-friendly table of rules (existing data will be presented in the new way)
Moves 'check links on custom error pages' to settings rather than global preferences
Increases maximum number of threads from 30 to 40 (will improve crawling for some sites) with the default now 12 rather than 7. Extreme left (labelled 'fewer') is still a single thread.
Version 3.9.1 / 3.9.2 / 3.9.3
Small fixes
Version 3.9
released September 2012
New view - 'Links by page' shows hierarchical view of your site's pages with its links below
All statuses are shown for redirected links rather than just the final one
Sorting available on all tables
efficiencies making crawl quicker and more memory-efficient, increasing the maximum size of site that can be crawled in one go
Blacklisted urls can be flagged (option added to preferences)
Adds 'Clear and Re-start' to File menu
Fixes context help 'i' button for timeout and delay fields
German localisation removed as had become out of date
Version 3.8.6
released August 2012
Fix to avoid problem experienced sometimes when pasting in a url from elsewhere
Links relative to scheme eg //domain.com (see http://www.ietf.org/rfc/rfc3986.txt section 4.2) handled better - previously problem if the page's base href was given in this format
Fixes last used settings not being saved properly
Toolbar pause button removed and role now taken by Go button
Uses alternating rows in tables
fixes redirected urls (3xx) not being highlighted yellow
Removes good colour from Preferences (to allow for stripey views)
Version 3.8.5
released June 2012
Adds support for telephone links such as tel: and skype: (now recognised and skipped rather than reported as an error)
Fixes bug relating crawling local sites introduced in 3.8.4
Fixes problem with crawling local sites if they are stored in the root Library folder
Fixes bug causing special characters such as ü, ö, ä in page title or link text being altered to u, o, a when exported. All exports (.dot, .csv, .tdl, .html) now export using utf-8 character encoding. Note that in line with web standards (RFC 1738) Integrity and Scrutiny don't support non-ascii characters in urls
Version 3.8.4
released May 2012
Fixes problem of xml sitemap not reading user's setting for update frequency
Fixes a bug which could cause hanging or crashes in certain circumstances
Fixes problem with thread counting, faster crawling
Version 3.8.3
released May 2012
Fixes spurious text appearing in 'Link text' for links on images where the images alt = '' (empty string)
Fixes bug preventing proper construction of urls where base href = "/"
Fixes bug affecting checking of broken images where image has src = "" and improved handling of empty quotes if that option is switched on
Fixes problem of crawl or 'recheck broken links' not always finishing properly
Fixes potential crash under certain circumstances (involving redirect, url having trailing slash and settings set to ignore trailing slashes)
Fixes bug affecting checking of broken images where image has src = "" and improved handling of empty quotes if that option is switched on
Default link check timeout shortened to 30s
Fixes bug preventing images from being found if 'src' doesn't follow 'img' in the html
Version 3.8.1
released April 2012
Fixes comma or trailing comma in blacklist fields preventing proper crawl
Adds preference to ignore trim leading or trailing spaces or mismatched quotes from a url
Fixes global prefs not being saved properly
When crawling locally, fixes 'file is directory' status being included in bad links
Some fixes to the 're-check bad links'. (Was causing crash sometimes since last release)
Highlighting link on page feature is switchable between highlighting and simply visiting page. Default is the latter.
Fixes problem of throbber sometimes continuing to turn when crawl or re-check has finished
Version 3.8
released March 2012
Adds 'Ignore trailing slash' button to settings, can be set per site, set to 'yes' by default
Fixes a problem preventing crawling of pages if braces { } are present in the url
When crawling local files, directories are not reported as an error (as long as the directory exists)
'Customize' added to toolbar (although this has been dropped by Apple from Lion 10.7 onwards so will only appear in 10.4 -> 10.6)
Options for sitemap update frequency 'daily', 'weekly', 'monthly' etc altered to lowercase for compliance with the sitemap standard
Two versions now maintained, one built for distribution via web (10.4 - 10.7 supported) and one certified and built for distribution via App Store (10.5 to 10.latest supported). The latter will have a .1 at the end of the version number in the About box, eg 3.7.5.1 is the App Store version. Both remain free
App Store version has Lion features such as full-screen mode
Version 3.7.5
released March 2012
Fixes bug preventing settings from being saved
Small changes for compliance with App Store
Version 3.7.4
released February 2012
Sends referrer header field for every request (other than the starting url) - this fixes a very small number of odd bugs
'Open local file' is added to the File menu. Functionality to crawl a site locally or import a list of links did exist in previous versions and was documented, but wasn't very accessible as it relied on a drag and drop into the starting url field (which still works and is to be improved in a future version)
Clears data from flat link view before starting a new crawl
Improves re-check broken links - now correctly uses as many threads as are set in settings and fixes problem preventing it from finishing every time. Also small fix to prevent it going into a loop if button pressed when there are no bad links
Adds background image and installation instructions to dmg file
Fixes bug preventing links to w3c being checked properly
Fixes a small memory leak
Fixes bug preventing crawl from finishing properly if user tries to highlight link on page before link has been checked
Fixes bug preventing date stamp from being written properly every time
Fixes problem of link text not showing in main link table for certain sites by trimming whitespace characters from around link text
Version 3.7.3
released November 2011
Links to subdomains can be considered as internal rather than external. ie peacockmedia.software and www.peacockmedia.software are considered the same site (which is not necessarily true but most people would expect) and therefore both are followed. Adds checkbox in global preferences to switch this option. Default is on. With the option on, Integrity will discover more links (and potentially more bad links) on certain websites. Option needs to be switched off if you wish to deliberately limit your crawl to one subdomain
Fixes memory problem, helping application to deal with larger sites
Bug fix and small improvement to 'my sites' drawer
Closing main window quits application after 'are you sure' dialogue
Version 3.7.2
released October 2011
Exports .dot file (standard format used by graphing applications) which can be opened as a visualisation in third-party graphing apps. includes colour to indicate levels. Accessed via File>Export or a new toolbar button added via 'Customize toolbar...'
Fixes problems with 'Re-check broken links' and 'Re-check this link'
Fixes 'on page as title / url' preference (broken in last version)
Adds 'Getting started' to the Help menu and splash screen
replaces 'Bad links' icon with a more suitable one (previous one looks like 'delete')
Fixes glitch with 'Inspect selected' button when flat view is showing
Version 3.7.1
released October 2011
Single version compatible with OSX 10.4 Tiger through to 10.7 Lion (minimum Intel / ppc 10.4)
(since v3.6, an older version, v3.5 was offered to Tiger users)
Improvements to user interface: toolbar - customisation includes space and flexible space, contents of settings tab move to fill the space as main window is resized
Fixes problem of user not being able to get main window open again if closed
Fixes bug causing base href not to be discovered which could lead to many improperly-constructed relative urls
Fixes distance column in flat view
Version 3.7
released August 2011
OSX 10.7 Lion compatible
Improves 'My Sites' - allows the same url to be saved more than once with different settings.
Version 3.6
released May 2011
Ability to import list of links, either html format or plain text list
Online manual linked from Help menu, includes instructions for crawling sites locally and importing a list of links
Moves list of sites from drop-down list to 'my sites' pop-out drawer
'last checked' date and status is stored and displayed
Last used settings are saved and visible on launch
Minimum system requirements now Intel / 10.5
Version 3.5.4
released March 2011
Fixes bug relating to empty href's.
improved reporting of link text and page titles which contain non-ascii characters.
Version 3.5.3
released January 2011
efficiency improvements (using internal cache rather than copying data, object retention / release)
Version 3.5.2
released November 2010
New option to allow 'not followed' links to be excluded from sitemap.
Fixes bug preventing Integrity from recognising a link if it has a carriage return immediately after the a.
Version 3.5.1
released October 2010
German localisation added.
Fixes bug causing crashes if internet connection fails or isn't stable.
Allows copy of url from 'on page' column of link inspector (as per filenames, requires two single-clicks to select the url - note that a double-click opens the page and attempts to highlight the link on the page using a style set in Preferences).
Fixes bug causing crawl to stop if starting url is redirected.
Version 3.4.1
released October 2010
Fixes bug causing random crashes introduced with major changes in 3.4
Version 3.4
released October 2010
Better string handling for urls and link text - makes running more efficient and correctly displays link text which includes non-ascii (non-English) characters.
Reduced background status logging also makes for faster running.
Fixes bug preventing sorting of flat view with 'bad links only' showing.
Fixes bug preventing generation of flat view if 'bad links only' showing when crawl finishes.
Other small fixes.
Version 3.3.6a
released September 2010
Fixes bug which caused instability with certain sites when using more threads.
Version 3.3.6
released September 2010
Fixes bug causing random crashes, especially when losing internet connection
Adds option to highlight missing link urls (where href = "#" or "" )
Version 3.3.5
released July 2010
Fixes bug preventing 'highlight link on page' feature working properly.
Fixes bug preventing crawling if comment terminated with more than two dashes eg '--->'
Fixes bug which prevented proper crawling if return or other characters were present inside </script> tag.
Version 3.3.4
released June 2010
Fixes bug which prevented proper crawling if return characters were present inside the <a> tag.
Version 3.3.3
released May 2010
Fixes bug which could cause crashing if using a custom user-agent string.
Context help added for some options.
Version 3.3.2
released May 2010
Minor improvements when checking sites on a local drive; improves adding 'file://' before crawling, and fixes bug preventing proper crawling.
Version 3.3.1
released April 2010
Adds setting - 'don't check external links' - makes crawl faster if you only need to generate a sitemap.
Version 3.3
released January 2010
Checks distance of each url from home page. Can be displayed as a column in Integrity's table views and exported files. See Preferences to switch this column on or off.
Generates XML sitemap. Note that the sitemap will be generated according to settings for the url crawled. (ie it is important to have settings like 'page titles are unique' or 'ignore querystrings' set correctly). Priority can be filled in automatically based on distance from home page.
Version 3.2
released November 2009
Changes to the user interface. Current url is displayed in a combo box along with the 'go' button at the top of the main window. The settings for the current url (previously called 'current config') are now displayed in the default tab of the main window. Flat and sortable views are now switched using tab buttons at the bottom of the main window.
Option for checking broken images added. Image urls are denoted by [img src] in the link text column.
Bug fix - alt text is now correctly shown (if it exists) in the link text column when the link contains an image rather than text. For example: [linked image]:NHS Direct
Some improvements to saving / deleting of settings for current site.
Auto-complete added to main url combo box. However, this only works if you type the 'http' or 'www' or however the saved url starts.
Progress indicator added for 'recheck this link'. Response time and time stamp are also correctly updated.
Help and donate links updated.
Automatic checking for updates. Checks for updates on startup. If a new version is available, informs user and invites visit to download page.
Version 3.1.2
released October 2009
Explicitly doesn't handle cookies (random behaviour previously).
will now pick up links within imagemap area tags.
Version 3.1.1
released March 2009
Fixes bug which stopped further crawling if initial page is redirected.
Small efficiency/speed improvement.
Fixes bug which could register incorrect links if a request is redirected more than once.
Version3.1
released December 2008
Time stamp logged for each link checked Views are now customisable - show or hide columns as you like. (Exported files reflect visible columns.) "Redirected" no longer shows in status column as the information is available in its own column New application icon with less transparency
Version3.02
released December 2008
Fixes bug related to unquoted href's Unique
page titles option (was new with v3.0 - crawls site faster and more
accurately if you set this option and if your page titles *are* unique)
now defaults to off for existing configs; defaulting to on was causing
confusion.
Version3.01
released December 2008
Fixes bug preventing proper crawling of framesets Fixes problem with pause/continue button Fixes problem with About panel
Version3
released December 2008
Adds 'Inspect Bad Links' to View menu (opens the first bad link in the link inspector) Adds 'Next Bad Link' button to link inspector (moves the link inspector to the next bad link if there is one) Adds two new tools to the toolbar for 'Inspect bad links' and 'Inspect selected link' and a 'Customise Toolbar...' menu item Adds
highlighting feature - double-click an 'On page' from the list in the
link inspector, Integrity will open selected page and highlight
selected link with coloured background or coloured border. Adds
drop-down lists to preferences allowing you to choose the style of the
highlighting (border / background, style and width of border) Adds
'Archive pages while crawling' checkbox to preferences (archives pages
while crawling - asks you for a save location when crawl is finished).
Version 2.2.2
released September 2008
If the link is around an image rather than text, the 'link text' columns will display [img]: and the alt text of the image. 'Redirected to' column added to flat view. Changes
to button bar including addition of export as html, csv and text (tdl)
buttons. Now properly autosaves user customisation. More information in the status display - now also shows how many bad links have been found
Version 2.2.1
released September 2008
Was
generating the 'flat view' multiple times, giving the impression of
'hanging' after crawling large sites using lots of threads. Bug fixed,
and progress bar added.
Version 2.2
released July 2008
Server
response time is logged. This is the time taken between Integrity
sending the request and receiving the first response. This may not
reflect the actual server response time if Integrity is running a large
number of threads, or if the internet connection is busy
When Integrity has finished running, a 'flat' view is available, that can be sorted by any of the columns
Global preferences and current config are now combined into one tabbed window
Standard customisable toolbar added and main window rearranged. Stop is now renamed 'Pause'
Version 2.1
released June 2008 Crawls local files (drag the file into the 'starting URL' box)
Version 2.0 (beta)
Architecture
/ Logic changed. This fixes thread-safety issues (ie v1.x crashing on
faster machines when using larger number of threads). Architecture
change also makes v2 faster. Now handles sites built using frames. Max
number of threads increased. This was limited in version 1.6.6 as a
quick-fix to thread-safety issues. Max number of threads (when slider
is in 'more' position) is now 29, was 7. 'Threads' are no longer really separate threads owned by Integrity, but simultaneous asynchronous requests.
Version 1.6.11
released May 2008
Fixes
bug which was causing some links to be skipped on certain pages.
Integrity's parser was getting confused sometimes by javascript on
pages containing 'less than' and 'greater than' operators. Other small fixes and efficiencies.
Version 1.6.10
released April 2008
Progress indicators added to export functions. Link info window now shows all occurrences of a link alongside the link text for each occurrence.
Version 1.6.9
released April 2008
Fixes bug related to trimming which randomly prevented complete crawling of whole site. Revised handling of incorrectly nested quotes - now correctly allows for apostrophes as part of url ( "/pdf/Educators'_Guide" ). Help menu now links to support pages of peacockmedia.software, 'Donate' menu option added.
Version 1.6.8
released April 2008 Routines for trimming whitespace, querystring etc rewritten in pure C, improving efficiency. Better handling of incorrectly nested single/double quotes ( href = "http://..' ) Now correctly handles base href's which don't give a scheme (assumes http://) Better
trimming of whitespace, ie carriage returns and other control
characters in unexpected places in the middle of <a ..> tags Shows how many times a link occurs, not just how many pages it appears on (ie it may appear multiple times on same page).
Version 1.6.7
released March 2008 Fixes
bug which prevented links being found on a page if the end of a comment
and an 'end script' tag were adjacent to each other (
--></script> )
Version 1.6.6
released March 2008
sends user-agent string in header - default is "integrity/1.6" but this
can be changed (see Preferences) if your site needs integrity to appear
to be a recognised browser. Other fixes and efficiencies.
Version 1.6.5
released November 2007 'whitelists' and 'blacklists' from the config are no longer case-sensitive. some problems with mcms zref fixed. zrefs are now shown when good links are hidden.
links which are not checked because they are in the blacklist, are
treated as good links. They are hidden when good links are hidden and
are given no colour label. "Hide good links" button has now become
"Show bad links only". This subtle change means that links which have
not been checked will not show and improves running. Small fixes and efficiencies.
Version 1.6.4
released October 2007 Fixes problem with tab-delimited file export Both tab-delimited and comma-separated exports are 'flat', ie each 'on page url' has its own row
Fixes crashes or problems caused by carriage returns or whitespace
present within a quoted href (yes, some html has really unexpected
features) Ignores Javascript (anything between <script> tags) More object retention fixes and small efficiencies
Version 1.6.2
released September 2007
'on page url' will now recognise 'https://peacockmedia.software' and
'https://peacockmedia.software/' as the same link. Therefore a broken links
may more correctly be reported on a lower number of pages and the whole
application is a little more efficient. Recognises and reports 'zref' links, a difficult-to-find link inserted by Microsoft Content Management Server other small efficiencies and fixes.
Version 1.6.1
released July 2007 Some changes to improve stability
Version 1.6
released July 2007
Adds user-definable colour labels (see Preferences). A 'good link' is
defined as server response code 2xx, redirected links include any 3xx
code, a bad link is a 4xx code, and an 'error' is a 5xx server code or
any other error. Menu item added View > Info for Current Item
(command-I), shows link inspector pallette (previously only available
via double-click in the main table). Fixes bug causing crash if no internet connection.
Version 1.5
released 28 May 2007 Supports base href. Can now export tab-delimited text file along with CSV, plain text and HTML. Improved HTML export - link urls are presented as links. Adds 'Only follow links containing...' field. Fixes bug allowing some 'commented out' urls to be tested. Fixes bug preventing inspector window opening when some links double-clicked. Preferences window added: allows choice of displaying 'on page' as url or page title. Config Starting URL drop-down list behaviour improved .
Version 1.4.2
released May 21 2007 No longer parses and extracts links from error pages (eg 404 pages). Now handles spaces in URLs (as long as correctly contained in single or double quotes).
Version 1.4
released April 22 2007 Fixes a problem in some earlier versions which prevented all links being found on some pages HTML character entities in links are now 'un-encoded' (eg '&' is replaced with '&') before link is checked. If link appears on more than one page, main table now shows actual number of pages rather than "multiple" 'Re-Check Bad Links' feature added (under File menu) Fixes problem with export to CSV for some sites. NB. early copies of 1.4 give the version number as 1.3.1 in about box.
Version 1.3.1
released April 7 2007 Fixes problem with the 'don't check URLs containing' feature which didn't work properly in v1.3 Fixes problem which caused some links to be missed Small improvement to the stop button
Version 1.3
released April 6 2007 'This page only' checkbox added. Status display more accurately shows number of links done.
Programme flow, thread safety and object retention improvements. Cures
an instability which seemed to be related to websites which have large
collections of external links and/or setting a larger number of threads. Fixes bug preventing some link text from being recorded properly.
For some file types which may be larger files (pdf, mpg, mp3, jpg) the
parser no longer sends an http request to check the 'Content-Type',
speeding up the crawl time.
Version 1.2
released March 29 2007
Now tolerant to excessively long hrefs (previously hrefs over 1000
characters would break an internal limit and cause the application to
crash). Timeout can now be set in the config window. Using a very
large number of threads can obviously make timeouts more likely and so
the timeout figure can now be increased accordingly. The link
inspector window (double-click an entry in the main table) now shows
the 'on page' list in a form which is clickable. A double-click will
open the page in question. The HTML report now shows the 'on page' column as links to the page in question.
Version 1.1
released March 26 2007
Link text shows up for more links - link text is still only held once
regardless of how many instances of that link are found on the site,
but if a link has no text (eg image link), then that will not overwrite
the existing link text. Ignores javascript links as well as mailto links. Fixes bug triggered by a return within the tag. Fixes bug which could prevent all links being found on certain pages.
Version 1.0
released March 25 2007 First non-beta release, free and not set to expire. Not generally released, but provided to 2 magazine coverdiscs.
Version 0.5 (Beta)
released March 22 2007 corrected problem which allowed cached data to be checked - new data is now requested every time. Fixes bug which could prevent some links being found if javascript present in page.
Version 0.4 (Beta)
released March 21 2007 Bug fixed which prevented some relative URLs from being formed correctly
Displays better information about any redirected urls. The final status
code shown is the status for the final (redirected to) URL Link text included as column in main table
Change to programme flow and a number of small refinements and
efficiency improvements meaning that the application remains responsive
throughout larger crawls. Bug fixed which prevented some configs saving properly
Version 0.3 (Beta)
released March 7 2007 Improved interface, added 'Continue' button, allows Integrity to be paused and re-started. Exporting - results can be exported as HTML, CSV or plain text.
Version 0.2 (Beta)
released March 1 2007 Fixes bug preventing Integrity from following links where html is all uppercase.