Identification of the Breach of Short-Term Rental Regulations in Irish Rent Pressure Zones

Abstract

The housing crisis in Ireland has rapidly grown in recent years. To make a more significant profit, many landlords are no longer renting out their houses under long-term tenancies but under short-term tenancies. Regulating rentals in Rent Pressure Zones with the highest and rising rents is becoming a tricky issue. In this paper, we develop a breach identifier to check short-term rentals located in Rent Pressure Zones with potential breaches only using publicly available data from Airbnb (an online marketplace focused on short-term home-stays) and Irish government websites. First, we use a Residual Neural Network to filter out outdoor landscape photos that negatively impact identifying whether an owner has multiple rentals in a Rent Pressure Zone. Second, a Siamese Neural Network is used to compare the similarity of indoor photos to determine if multiple rental posts correspond to the same residence. Next, we use the Haversine algorithm to locate short-term rentals within a circle centered on the coordinate of a permit. Short-term rentals with a permit will not be restricted. Finally, we improve the occupancy estimation model combined with sentiment analysis, which may provide higher accuracy.

Share and Cite:

Liu, G. , Arnedillo-Sánchez, I. and Chen, Z. (2023) Identification of the Breach of Short-Term Rental Regulations in Irish Rent Pressure Zones. Journal of Computer and Communications, 11, 8-19. doi: 10.4236/jcc.2023.112002.

1. Introduction

The housing crisis in Ireland has rapidly grown in recent years. To make a more significant profit, many landlords are no longer renting out their houses under long-term tenancies but under short-term tenancies. Under Irish regulations, Short-term Tenancies are defined as those for less than 14 days. According to Airdna [1] and Residential Tenancies Board [2] , the average monthly revenue from short-term rentals is greater than the revenue from long-term rentals, which has resulted in the number of available long-term rentals continuing to decline while lucrative short-term rentals are becoming more popular.

The shift from long-term to short-term rentals has harmed the supply of private housing rentals. During the housing crisis, the number of houses used for tourism is higher than those used for private rentals in every county except Dublin. According to Inside Airbnb [3] , the number of rental properties on Airbnb1 in the Irish capital alone has increased from around 1700 in 2016 to more than 7064 in 2022, 97.2% of which are short-term rentals. This is even more serious in those popular tourist counties.

As a ripple effect, rental prices have increased significantly. According to Daft.ie [4] , the rental market in Ireland has hit new highs as the number of available homes has fallen to historically low levels. Rents in the second quarter of 2022 rose by an average of 12.6% compared to the same period in 2021. This is the largest increase since Daft.ie began reporting in 2006 and an all-time low in the number of homes available for rent, with the scarcity of available homes in Ireland now unprecedented. The supply of rental properties has fallen by a staggering 97% compared to 2009. Only 716 homes were available for rent nationwide on August 1, down from nearly 2500 a year ago and a record low since 2006.

The government has approved measures to tighten regulation of the short-term rental market. Under the new regulations, short-term rental platforms such as Airbnb and landlords will be fined if they advertise non-compliant properties from September 1, 2022 [5] . The regulations target areas where the accommodation shortage is most acute, requiring Airbnb to obtain planning permission before advertising on their platforms in Rent Pressure Zones (see Section. 2.1). In 2021, 286 short-term rentals were advertised on Airbnb in Cork. In the same year, Cork Council launched 94 investigations and issued 68 warning letters. However, there are still 11 counties that have not taken any action against these short-term rental owners on the popular website to check if they are in violation, making it a difficult issue to regulate short-term rentals in Rent Pressure Zones [6] . The Office of the Planning Regulator previously stated that regulating short-term rentals in Rent Pressure Zones is becoming tricky. On August 2022, the Irish government has not yet provided an efficient way to investigate it, which motivated us to write this paper.

This paper demonstrates how to identify short-term rentals located in Rent Pressure Zones with potential violations using only publicly available data from Airbnb.

1) We use a Residual Neural Network to identify and filter out outdoor landscape photos. These noisy photos negatively impact identifying whether an owner has multiple short-term rentals in a Rent Pressure Zone.

2) A Siamese Neural Network is used to compare the similarity of images to determine if the interior regions of any two houses are highly similar. There are cases in Airbnb where the same home is posted multiple times or different rooms in the same home are posted multiple times.

3) We use the Haversine algorithm to locate short-term rentals within a circle with a radius of 150 meters, centered on the coordinate of a permit. Short-term rentals with a permit will not be restricted.

4) We estimate the occupancy of a house by its user reviews combined with sentiment analysis.

2. Background

2.1. Rent Pressure Zones

A Rent Pressure Zone is a designated area in Ireland where rents cannot exceed the general inflation rate, as recorded by the Harmonized Index of Consumer Prices, or 2% per year pro rata. This applies to both new and existing tenancies (unless an exemption is being applied for). Rent Pressure Zones are located in the parts of the country with the highest and rising rents and where households have the most difficulty finding affordable accommodation [7] . They are designed to moderate rent increases in these areas and create a stable and sustainable rental market, allowing owners and tenants to plan financially for their future.

For areas to be designated as Rent Pressure Zones, the following criteria are used:

• The standardized average rent for the previous quarter must be higher than the appropriate reference standardized average rent for that quarter.

• Annual rental inflation in the area must have been 7% or higher in four of the last six quarters.

Ireland now has three different standardized average rents, and the area’s location determines the assessment of which method is used. Dublin is now compared to the national standardized average rent. Kildare, Meath and Wicklow are now compared to the national standardized rent, excluding Dublin. The other areas are now compared to the standardized average rent outside the Greater Dublin Area. On July 2022, there are 48 areas set as Rent Pressure Zones in six local authorities, as shown in Figure 1.

2.2. Planning Permission

According to the new regulation for short-term rentals [9] , if an owner rents out an entire house or one of its rooms on a short-term basis, they may need to apply for planning permission [10] from the local authority to use it for short-term rentals. The requirement for planning permission only applies to owners in Rent Pressure Zones. Once planning permission is granted, they can use the entire principal residence for short-term rentals of more than 90 days or rent out a second house for short-term rentals when they are not at home.

Figure 1. The map of Irish Rent Pressure Zones (2022) [8] .

3. Data Collection

3.1. Airbnb House and Owner Data

The publicly accessible home data (Figure 2) and owner data (Figure 3) were obtained using Airbnb APIs, including home type, reviews, home photos, the total number of an owner’s homes, etc. No private information was used.

An important issue is that those home locations were randomized by Airbnb. In detail, the public geographic coordinate of a house in Airbnb data is a random location within a circle with a radius of 150 meters, centered on its actual geographic coordinate. So even if two rooms come from the same building, it is impossible to distinguish them directly.

3.2. Geographic Information of Rent Pressure Zones

The Local Electoral Area Dataset [11] contains geographical features, as shown in Figure 4, which helps identify whether a house is located within a Rent Pressure Zone.

Figure 2. An Airbnb house entry.

Figure 3. An Airbnb owner entry.

Figure 4. A local electoral area entry.

3.3. National Planning Application Dataset

The National Planning Application Dataset [12] provides details on all planning applications in Ireland and contains polygon coordinates of housing applications and descriptions of permission applications, as shown in Figure 5. This dataset was first filtered to obtain all entries related to short-term rental applications.

4. Data Preprocessing

4.1. Locating the Centroid of House

The National Planning Application Dataset uses polygon coordinates to record the geographic location of a house. The centroid coordinate was calculated according to [13] , and used to uniquely identify a house’s location during this stage, as shown in Figure 6.

4.2. Filtering Houses in Rent Pressure Zones

The PNPOLY algorithm filtered out houses not located in a Rent Pressure Zone. This algorithm was proposed by [14] to solve the problem of determining whether a coordinate is inside a polygon for a geographic information management system.

Figure 5. A national planning application entry.

Figure 6. The centroid of a house.

5. Methodology

The breach identifier contains three components:

• A Principal Residence Identifier containing two components:

- An image classifier for indoor and outdoor. It deletes noisy outdoor photos before image similarity detection.

- An image similarity detector used to determine whether photos from different posts come from the same residence or room.

• A Permit Finder. Under the rules, it is necessary to determine whether a house has planning permission if it is not an owner’s principal residence or its cumulative rental period within one year is more than 90 days. The permit finder checks if a house is within the scope of a valid short-term rental permit.

• An Occupancy Estimator. The rules require that when an owner entirely rents out their principal residence, it cannot exceed 90 days. Due to trade secrecy issues, Airbnb does not disclose the occupancy time of a home, so the only way to estimate it is by the number of reviews.

5.1. Principal Residence Identifier

The principal residence identifier is based on the rules for short-term rentals within Rent Pressure Zones. If a short-term rented house is not an owner’s principal residence, they need to obtain the relevant planning permission to rent it out legally [9] . The flow chart is shown in Figure 7, where an owner’s all house photos are filtered through an image classifier based on a Residual Neural Network to remove outdoor photos, then a Siamese Neural Network is used to compare the similarity of indoor photos to remove duplicate houses. Unlike a model that learns to classify inputs, a Siamese Neural Network learns how to differentiate between two inputs by learning their similarity [15] . It shows better performance in situations involving the discovery of similarity or relationship between two comparable things.

5.1.1. Image Classification for Indoor and Outdoor

Some posts on Airbnb contain many outdoor photos, such as beautiful landscapes, because owners want to use them to attract more visitors. These noisy data can affect the subsequent identification of whether multiple short-term rentals correspond to the same house posted by an owner. For example, two posts contain the same landscape photos in Figure 8, but come from different houses. So it is required to remove these noisy images. The image classifier uses a Residual Neural Network ResNet-18 [16] to classify whether an image is indoor or outdoor and delete outdoor photos.

5.1.2. Image Similarity Detection

There are two legitimate scenarios in which the breach identifier may incorrectly assume that an owner has multiple homes in a Rent Pressure Zone, which is not allowed.

• Owners can rent out multiple rooms in their principal residence without restriction [9] . They can post two or more rooms in their principal residence on Airbnb.

• Owners may post the same room multiple times on Airbnb to give their home more exposure. In Figure 9, these two posts correspond to the same room, but the photos were taken from different angles.

Figure 7. The principal residence identifier.

Figure 8. Two posts have the same outdoor photo.

Figure 9. Two posts correspond to the same room.

To address these two issues, we should check photos from different posts to see whether they come from the same principal residence or the same room. If a room is posted multiple times, there must be many similar or the same photos. If multiple rooms are from the same residence, their photos probably contain identical common areas such as the kitchen, hallway and bathroom. For example, three posts in Figure 10 are from different rooms in the same house. They have identical common areas with a high degree of similarity.

In most cases, two images will not be fully identical even if they have the same scene or object due to differences in size, scale, rotation and skew. So the comparison cannot be made directly using pixel-to-pixel methods. The problem shifted from identifying pixel similarity to object similarity.

The sentence-transformers library was used in this project to provide a way to compute a dense vector representation of an image and find identical images [17] . It encodes all images into vector space and finds high-density regions corresponding to areas where the images are similar. For example, when using image search, a photo is entered and converted into a set of vectors. Sentence-transformers compare it with other images in the database and find the image with the highest similarity (the closest distance). When two photos from one post have greater than 95% similarity to the photos from another post, or one photo has 100% similarity to another, two posts are considered to be associated with the same room. So the breach identifier will not assume its owner has violated the rules.

5.2. Permit Finder

As the actual geographic coordinate of a house has been anonymized by Airbnb, replaced with a random location within a circle with a radius of 150 meters as Figure 11, it is impossible to know whether a permit is available for a short-term rental house by conventional coordinate comparison. We use the Haversine algorithm [18] to determine whether a house is within a circle with a radius of 150 meters, centered on the location of any known permit. By using the longitude and latitude of two locations, the Haversine algorithm can calculate their distances on the surface of the Earth. Because of the anonymization issue, when there are multiple houses around a permit, they will all be considered to have the same available permit.

Figure 10. Three posts correspond to the same house.

Figure 11. The anonymous geographic coordinate of a house.

5.3. Occupancy Estimator

The occupancy estimator is based on the rule that, in some cases, short-term rentals are not allowed to accumulate more than 90 days within a year [9] . One of the biggest issues with Airbnb is whether owners rent residential properties as hotels permanently rather than occasionally sharing. On Airbnb, the occupancy time of a house is unpublished data. However, it can be estimated by the number of user reviews.

In [19] , Doris Chenguang Wu et al. found that the sentiment of a hotel’s reviews affected its occupancy. Inside Airbnb uses an occupancy estimation model named the San Francisco model [20] to estimate how often an Airbnb house is being rented out (Equation (1)) and approximate its income. The review rate in Equation (1) can be different numbers. Inside Airbnb set it to 50%, meaning half of the tenants write reviews about houses. For the average number of nights of stay, Airbnb reported that it was 5.5 nights in San Francisco. If the average number of nights in a city is unknown, then three nights should be used. Lastly, the minimum number of nights of stay can be set for a house by its owner.

Occupancy = NumberofReviews ReviewRate × max ( AvgNights , MinNights ) (1)

In this project, an improvement for the San Francisco model was proposed. Instead of a fixed review conversion rate, the improved model uses a rate of 50% as a base and has a sentiment bias of 10% increment or reduction depending on the sentiment score of house reviews. Better reviews lead to more occupancy time. Our improved formula is Equation (2), where the average number of nights of stay for guests is 4.6, taken from public Airbnb data, and the minimum number of nights of stay can be set for a house by its owner. Finally, the review rate is set as 50%.

Occupancy = NumberofReviews ReviewRate + SentimentBias × max ( AvgNights , MinNights ) (2)

Equation (2) has an important variable called Sentiment Bias. The flow chart of its calculation is shown in Figure 12. TextBlob, a Python library for natural language processing [21] , is introduced into the calculation. TextBlob mainly uses pre-trained built-in classifiers to perform sentiment analysis and can obtain a corresponding sentiment score for a review.

1) The reviews of a house are filtered to keep only the reviews within one past year.

2) Reviews may be written in non-English languages, and DeepL API2 is used to translate those reviews to English automatically.

3) The TextBlob analyzer calculates an average sentiment score for all reviews. A sentiment score is between [−1, 1] where −1 defines extremely negative emotion and 1 defines extremely positive emotion.

4) With the sentiment score, we defined a sentiment bias as Equation (3).

SentimentBias = SentimentScore × 0.1 (3)

6. Result and Evaluation

There were 1713 short-term rentals located in Rent Pressure Zones in the collected data. 845 of them were identified as having potential breaches, and 21 principal houses with occupancy greater than 70 days were about to breach the rule. The overall breach rate was 50.67%. The reality can be worse, as a permit only works for one house in the actual situation. However, the anonymous coordinates of houses may cause several houses to be simultaneously located within an available permit, resulting in a situation where one permit is used for multiple houses.

Figure 12. The calculation of sentiment bias.

Because Airbnb does not disclose accurate house coordinates and occupancy data, individuals cannot verify the accuracy of the breach identifier. The accuracy of the occupancy estimator cannot be verified either. It only provides an estimate within a reasonable range. Tenants should be skeptical of short-term rentals that are flagged as possible breaches.

7. Conclusions

In summary, this project achieved the following contributions and benefits to the current regulation of the Irish short-term rental market:

• It provided a viable method to identify potential short-term rentals that may be in violation and their owners when no accurate data is available.

• It provided a more efficient way to monitor the short-term rental market rather than relying on traditional user reporting.

• It provided an image identifier that can determine whether multiple rental posts correspond to the same room or residence by calculating the similarity of room photos. This model may be applied to other areas. For instance, some second-hand goods websites may prohibit users from posting duplicate items.

• It improved the occupancy estimation model combined with sentiment analysis, which may provide higher accuracy. The improved model may also be used for hotel occupancy estimation, etc.

NOTES

1https://www.airbnb.ie

2https://www.deepl.com/pro-api

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1] AirDNA. AirDNA Vacation Rental Data.
https://www.airdna.co/vacation-rental-data/app/ie/default/dublin/overview
[2] Residential Tenancies Board (2022) Residential Tenancies Board Publishes Q1 2022 Rent Index. Tech. Rep.
[3] Cox, M., Morris, J. and Higgins, T. (2022) Inside Airbnb: Dublin.
http://insideairbnb.com/dublin
[4] Daft.ie (2022) The Daft.ie Rental Price Report. Tech. Rep.
[5] Citizens Information (2022) Renting Your Property for Short-Term Lets.
https://www.citizensinformation.ie/en/housing/owning_a_home/home_owners/renting_your_property_for_shortterm_lets.html
[6] Gataveckaite, G. (2022) Revealed: The Counties That Are Clamping Down on Airbnb Rentals... and the Ones That Aren’t.
https://www.independent.ie/irish-news/revealed-the-counties-that-are-clamping-down-on-airbnb-rentals-and-the-ones-that-arent-41595034.html
[7] Government of Ireland (2021) The Residential Tenancies (Amendment) Act 2021. Tech. Rep.
[8] Department of Finance, Government of Ireland (2021) Housing and Property Sector Chartpack. Tech. Rep.
[9] Department of Housing, Local Government and Heritage, Government of Ireland (2022) Planning and Development Regulations 2001-2022. Tech. Rep.
[10] Citizens Information (2022) Planning Permission.
https://www.citizensinformation.ie/en/housing/planning_permission/planning_permission_general.html
[11] Ordnance Survey Ireland (2019) Local Electoral Areas—OSi National Statutory Boundaries-2019.
https://data.gov.ie/en_GB/dataset/local-electoral-areas-osi-national-statutory-boundaries-2019
[12] Department of Housing, Local Government and Heritage, Government of Ireland (2022) National Planning Applications.
https://data.gov.ie/dataset/national-planning-applications
[13] Bashein, G. and Detmer, P.R. (1994) Centroid of a Polygon. 1em plus 0.5em minus 0.4em USA. Academic Press Professional, Inc., Cambridge, 3-6.
https://doi.org/10.1016/B978-0-12-336156-1.50010-0
[14] Franklin, W.R. (2022) Pnpoly-Point Inclusion in Polygon Test.
https://wrfranklin.org/Research/Short_Notes/pnpoly.html
[15] Chicco, D. (2021) Siamese Neural Networks: An Overview. In: Cartwright, H., Ed., Artificial Neural Networks, Springer, Berlin, 73-94.
https://doi.org/10.1007/978-1-0716-0826-5_3
[16] He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778.
https://doi.org/10.1109/CVPR.2016.90
[17] Reimers, N. and Gurevych, I. (2019) Sentence-BERT: Sentence Embeddings Using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Pro-cessing (EMNLP-IJCNLP), Hong Kong, November 2019, 3982-3992.
https://doi.org/10.18653/v1/D19-1410
[18] Robusto, C.C. (1957) The Cosine-Haversine Formula. The American Mathematical Monthly, 64, 38-40.
https://doi.org/10.2307/2309088
[19] Wu, D.C., Zhong, S., Qiu, R.T. and Wu, J. (2022) Are Customer Reviews Just Reviews? Hotel Forecasting Using Sentiment Analysis. Tourism Economics, 28, 795-816.
https://doi.org/10.1177/13548166211049865
[20] Inside Airbnb (2022) Inside Airbnb: Data Assumptions.
http://insideairbnb.com/data-assumptions
[21] Loria, S. (2022) Text-Blob: Simplified Text Processing.
https://textblob.readthedocs.io/en/dev

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.