A semi-duplicate is an article or other page that has a significant overlap with another article or other page or is otherwise a topic that would expect to be combined in 1 article. A semi-duplicate is distinct from a full duplicate in that a full duplicate is on exactly the same thing such as petrol/gasoline where the different names refer to exactly the same thing. Generally full duplicates are created accidentally when people don't realize that an article already exists while a semi-duplicate is usually created deliberately, that is to say they think a distinction should be made even though it probably shouldn't. Lsjbot created many semi-duplicates on the Cebuano and Swedish Wikipedias for example creating separate articles for municipalities in Italy with the same name as a settlement even though it is normal for such topics to be combined into a single article. If such articles do exist they tend to become content forks of each other however the term content fork can refer to a descriptive title that is split from the main article to represent a particular point of view (otherwise it may be argued this page is itself a semi-duplicate). Although Wikipedia is not a dictionary it is usually desirable to combine certain topics that have the same name and overlap.

Examples edit

Dealing with articles edit

If only a small number of separate articles exist then boldly merging or tagging them for merging is probably appropriate. If a large number of articles exist you should probably discuss at a Wikiproject or a RFC to decide if articles should make a distinction.

If you merge a complete duplicate the Wikidata items should normally also be merged, see d:Help:Merge however Wikidata may make a distinction for what we would not normally so care must be taken if its also standard on Wikidata.

Difference between semi-duplicates and other problems edit

The difference between content forking and semi-duplicates is, the content that is being forked that shouldn't be in a particular article, such as if there was a significant amount of information about apples in the Orange (fruit) article the correct thing would be to move the content about apples into the Apple article rather than merge Orange (fruit) to Apple. On the other hand there may be separate articles for a settlement and low level municipality even though the separate article for the municipality does not duplicate much content in the settlement article for example it would still be a semi-duplicate. Generally per WP:BEFORE C1 if there are problems with content forking they should be dealt with through normal editing rather than merging the articles. Similarly, just because the articles don't have content forking problems due to good editing doesn't mean the articles should be kept separate, similar to Wikipedia:No amount of editing can overcome a lack of notability. However, semi-duplicates have a higher risk of content forking though the article may be less likely to be merged if distinct content has been added that may justify an exception.

The difference between notability and semi-duplicates is, notability normally concerns substantially distinct topics, such as a settlement vs. primary school, where the primary school may be merged with the settlement because it's not notable enough for a standalone article. If separate articles exist for the settlement and primary school it's unlikely they would duplicate much but the school may not pass the notability guidelines. On the other hand, with a settlement v s.low level municipality the municipality would clearly pass the notability criteria as with the settlement but such topics overlap too much to have different articles.