Introduction

In recent times, the assessments of land use and land cover (LULC) changes have been used in the monitoring and management of ecological changes in different parts of the world1. Land use refers to the different human activities on the land which result in changes in the vegetation structure, water bodies, soil, rocks and other natural resources of an area2. Having accurate knowledge of the LULC of a place enhances the proper management of the challenges associated with the land. Apart from this, knowledge of the past present and future LULC changes also enables an appropriate estimation of the socio-economic and environmental impacts of such changes3. These LULC changes are direct products of increased global human activities and urbanization4,5.

Human activities are more impactful on the vegetation cover of terrestrial ecosystems thereby leading to environmental changes at local, regional and global levels6. These environmental changes also include an increase in the surface temperature due to the transformation of vegetation covers to other land use forms such as bare surfaces, solid surfaces, and agricultural lands7,8. It has been predicted that the land surface temperature of most parts of the world, especially developing countries will increase geometrically due to the impacts of pollution and urbanization in the year 20509. Among other factors, population increase and uncontrolled and improper management of changes in LULC of urban areas are contributors to global climate change which increased surface temperatures. Hence, the assessment of LULC changes in an area will enable the understanding of the degree and spatial extent of anthropogenic changes there6.

The land surface temperature (LST) has been described as very essential in the assessment of the earth's surface features including the LULC and others10. Many studies have proved the influence of LULC on the LST of different parts of the world using remote sensing techniques6,7,8. Satellite remote sensing and geographic information systems are viable tools used for investigating the intensity of human impacts on the ecosystems through the mapping of LULC changes and vegetation indices within a stipulated period11. Remote sensing makes it easier and more economical to access data on vegetation and LULC changes in an area at a specific time12. These spatial data can then be managed and analyzed accurately using GIS techniques13. Landsat sensors such as Landsat 5 Thematic Mapper (TM), Landsat 7 Enhanced Thematic Mapper (ETM) and Landsat 8 Operational Land Imager (OLI) have been used to assess LULC and vegetation indices across the world14.

Studying the LST of an area could supply useful information on the human survival of such an area15. It could also provide information on the survival of crops because extreme climatic conditions negatively influence the growth, survival and productivity of crops16. Landsat data such as Landsat 8 has been very instrumental to the study of LST at both local and regional scales17. On the other hand, the normalized difference vegetation index (NDVI) has been used to measure the presence and dynamics of vegetation such as the green leaf area index, vegetation cover, green biomass and vegetation productivity18. It indicates the vegetation condition and predicts the productivity of plants in several areas of the world19. The NDVI works on the principle of electromagnetic radiation in which the greenness portion of the vegetation shows less reflectance in the visible spectrum because of the absorption of photosynthetic pigments. Consequently, it has a maximum reflectance in the near-infrared region20. In this study, the impact of LULC changes on the LST and vegetation of Penang Island between the years 2010 and 2021 was examined.

Penang Island was chosen considering that it houses the UNESCO world heritage centre (Georgetown). Georgetown has a rich and diverse cultural heritage; hence it was enlisted as a world heritage centre in 200821. Over the years, this Island has witnessed lots of developmental transformations due to rapid industrialization and other anthropogenic influences22. It has been said that studying past events in the climate and land use change of an area will enhance proper deductions of the effects of such factors in the future23. Consequently, it became highly imperative for an assessment of the impacts of these anthropogenic changes on the vegetation of the famous city. Hence, the specific objectives include assessing the LULC changes in Penang over the 11 years; assessing the changes in LST and NDVI and; assessing the relationship between the LST and NDVI with regard to the LULC classes.

Results

The overall accuracy and Kappa coefficient for the year 2010 are 85.76% and 0.85 respectively while those for the year 2021 include 88.53% and 0.89 respectively (Table 1).

Table 1 Accuracy assessment of LULC classification of Penang Island.

Regarding the producer’s accuracy (2010 and 2021), all the LULC classes were greater than 70%. Similarly, the user’s accuracies for all the LULC classes also exceeded 70%. This indicates that the classification of the LULC classes was achieved with very high accuracy. The results of the LULC classification of Penang Island (Table 2) indicated that forests have the highest area of 12025.44 ha (39.17%) in 2010 (Fig. 1) and 12213.70 ha (39.78%) in the year 2021 (Fig. 2) whereas the water bodies were found to have the lowest area of land (189.12 ha in 2010 and 385.27 ha in 2021).

Table 2 LULC area of Penang Island (2010–2021).
Figure 1
figure 1

The LULC map of Penang Island in 2010 (created by the authors using ArcMap 10.8 software).

Figure 2
figure 2

The LULC map of Penang Island in 2021 (created by the authors using ArcMap 10.8 software).

The analysis of the rate of change reveals a drastic increase in the urbanized areas which had a 45.71% increase over the 11 years of observation (Table 3). During this period, the urbanized area increased from 6111.62 to 8905.43 ha. This amounts to an annual increase rate of 4.16%. Also, the forest lands experienced a slight increase of 1.57% during this period. The other LULC classes with increasing land areas include bare lands (257.56%) and water bodies (12.93%). On the contrary, the agricultural land had a decrease of − 33.49% while the rocks had a decrease of − 51.65%.

Table 3 The annual rate of change in the LULC of Penang Island (2010–2021).

LULC impact on LST

The satellite images studied have been characterized for the land surface temperature regarding the LULC classes. The average LST values as influenced by the LULC are presented in Table 4.

Table 4 Average LST (°C) and NDVI for different LULC classes.

In the year 2010, the bare lands and urbanized areas have the highest LST of 29.76 °C and 28.75 °C respectively (Fig. 3). Also in 2021, the urbanized areas exhibit the highest LST of 34.0 °C (Fig. 4). The forest lands have the lowest LST (23.60 °C) in 2010 which increased to 31.4 °C in 2021. Furthermore, the forest and agricultural lands were found to have the highest NDVI of 0.53 and 0.36 respectively in 2010 (Table 4, Fig. 5). This same trend was observed in the year 2021 where forest and agricultural lands had the highest NDVI (Fig. 6). Besides, the same LULC classes with lower NDVI values in 2010 still had lower NDVI in 2021.

Figure 3
figure 3

LST of Penang Island in 2010 (created by the authors using ArcMap 10.8 software).

Figure 4
figure 4

LST of Penang Island in 2021 (created by the authors using ArcMap 10.8 software).

Figure 5
figure 5

NDVI of Penang Island in 2010 (created by the authors using ArcMap 10.8 software).

Figure 6
figure 6

NDVI of Penang Island in 2021 (created by the authors using ArcMap 10.8 software).

Relationship between LST and NDVI across LULC types

The relationship between the LST and NDVI is shown in Table 5. As revealed by the linear regression analysis, the LST had negative correlations with the NDVI values of forests, agricultural lands and bare lands in the year 2010.

Table 5 The relationship between LST and NDVI.

In 2010, water bodies have the highest positive correlation between LST and NDVI (R2 = 0.91), followed by the urbanized areas (R2 = 0.53). The strongest negative correlation between LST and NDVI was observed in the agricultural lands (R2 = 0.32) and followed by the forest lands (R2 = 0.14). Interestingly, the forests and agricultural lands were discovered to have the lowest LST (23.6 °C and 24.63 °C respectively) and the highest NDVI values (0.21 each) in 2010. Also, a similar trend was observed in 2021 where LST exhibited strong negative correlations with NDVI in agricultural lands (R2 = 0.24) and forests (R2 = 0.20). The strongest positive correlation between the LST and NDVI was observed in the urbanized areas (R2 = 0.92), followed by the bare lands (R2 = 0.86).

Discussion

The use of a remote sensing approach in the assessment of the impacts of LULC changes on the LST and vegetation cover of an area is beneficial in enhancing appropriate land management decisions9. In this study, the agricultural (herbaceous) land and rocks experienced a drastic decrease in the land areas during these 11 years. The loss of agricultural lands and rocks could have been gained by the urbanized areas. A similar study revealed that Penang has a track record of increased urbanization due to the vision of the State Government to make the State a renowned World Trade Centre24. This has led to a great influx of investors into the State, hence leading to the industrialization of the State. This Penang Island has already been described as the most developed part of the State comprising the international airport, factories, and many residential buildings22. Another study on land use and land cover changes in the capital city of Malaysia (Kuala Lumpur) revealed the same trend of rapid loss of green areas to urbanized areas over the last 15 years9. This high rate of urbanization did not only affect the vegetation cover of the city but also increased the pollution impacts on the city.

Landsat data are known to be very useful in understanding the impacts of LULC on the LST of an area9. The increase in the LST of this study area is similar to the previous work which showed that the urbanized areas of Penang Island have the highest LST in 1999 and 200724. A similar observation was recorded in other cities in which the urban areas had the highest LST10,25. This means that the change in the urbanization and bare lands of the Island has influenced the LST. This is also caused by the loss in the vegetation cover of the agricultural or herbaceous lands to materials such as concretes, stones and tars used for urbanization. The lower LST values exhibited by forests and agricultural lands are attributed to their contributions to the photosynthetic pool of the area, thereby reducing the heat24. Therefore, urbanization involving buildings incorporated with vegetation (green buildings) and less concrete structures has been suggested as a way of reducing the LST of an area26,27.

It has been reported that the strength of the correlation is revealed by the linear regression coefficient28. The negative correlations observed between the LST and NDVI at the LULC classes indicate that the higher the surface temperature, the lower the vegetation cover or biomass of those LULC types24 and vice versa. Areas with high NDVI have been described as having enough vegetation cover which produces cooling effects thereby reducing the surface temperature22.

Conclusion

From this study, it has been revealed that Penang Island had a considerable increase in the urbanized areas and bare lands coupled with a greater loss in the green areas (particularly the agricultural/herbaceous lands). The forests in this city only had a slight increase in land area. This is somehow commendable as the city was able to maintain its forest lands despite the rapid urbanization. However, the loss of agricultural or herbaceous lands to urbanization is also worrisome. This is because the agricultural/herbaceous lands also had roles to play in ensuring the maintenance of the vegetation cover / photosynthetic productivity of the city. Also, urbanization can be inferred as the main driver of the changes in the LULC of Penang Island. Due to the importance of urbanization, there is a need for an appropriate policy in managing land use to ensure balance in the transformation of LULC classes. This study hereby recommends that policymakers should ensure that the greenness of the city is maintained through the regulation of urban development.

Methods

Study area

Penang is situated in the northern part of Peninsular Malaysia and lies within the latitudes 5°12'N to 5°30' N and longitudes 100°09'E to 100°26'E (Fig. 7). Penang with a land area of 295 Km2, has an estimated population of 720,000 and is regarded as the most populated island in Malaysia. Penang shares the same border on the north and east with Kedah State and the south with Perak State. There are two main parts of Penang State: Penang Island and the mainland which is also regarded as Seberang Perai. These two parts of the State are connected by the two bridges. The eastern part of Penang Island is the most urbanized area comprising industries, commercial centres and residential buildings. However, the western part is less developed comprising mainly hilly terrain and forests22. This study is focused on the Island part of Penang. This island is endowed with a yearly equatorial climate (hot and humid). It has a mean annual temperature ranging between 27 and 30 °C while the mean annual relative humidity ranges between 70 and 90%. Also, the mean annual rainfall is about 267–624 cm.

Figure 7
figure 7

The map of Penang State showing the Penang Island (created by the authors using ArcMap 10.8 software).

Data acquisition

The flow chart of the methodology is presented in Fig. 8. Landsat satellite images were used for the assessment of changes in land use covering a period of 2010–2021 (11 years).

Figure 8
figure 8

The flow chart of the methodology.

These images were gotten from the website of the United State Geological Survey (https://earthexplorer.usgs.gov). The Landsat images include the Landsat 5 TM (thematic mapper) and Landsat 8 OLI / TIRS (operational land imager / thermal infrared sensor). These were downloaded from the Landsat level 1 dataset (Table 6) with additional criteria which reduced the.

Table 6 The characteristics of the satellite data used.

Determination of LST and NDVI for Landsat 5 and 8

Band 6 of Landsat 5 and band 10 of Landsat 8 were used for the determination of the land surface temperature (LST). The LST and normalized difference vegetation index were determined using the following steps:

Conversion of top of atmosphere (TOA) radiance

Using the radiance rescaling factor, thermal infra-red digital numbers were converted to TOA spectral radiance using the equation below29: \(\frac{Red - NIR}{{Red + NIR}}\) \(\frac{Red - NIR}{{Red + NIR}}\) For Landsat 8,

$$ {\text{L}}\lambda = \left( {{\text{ML}} \times {\text{ Qcal}}} \right) + \left( {{\text{AL}} - {\text{Oi}}} \right) $$
(1)

For Landsat 5,

$$ {\text{L}}\lambda = \left( {{ }\frac{{{\text{LMax}}\lambda - {\text{LMin}}\lambda }}{{{\text{QcalMax}} - {\text{QcalMin}}}}} \right) \times \left( {\left( {{\text{Qcal }} - {\text{QcalMin}}} \right) + {\text{LMin}}\lambda } \right) $$
(2)

where Lλ is TOA spectral radiance, ML is radiance multiplicative band Number, AL is radiance add band number, Qcal is quantized and calibrated standard product pixel values (DN for band 6 or band 10), Oi is the correction value for the respective bands, LMaxλ is spectral radiance scaled to QcalMax, LMinλ is spectral radiance scaled to QcalMin, QcalMax is maximum quantized calibrated pixel value, and QcalMin is minimum quantized calibrated pixel value.

Conversion to TOA brightness temperature (BT)

Spectral radiance data were converted to TOA brightness temperature using the thermal constant values in the Metadata file29.

Kelvin (K) to Celcius (°C) degrees

$$ BT = {\raise0.5ex\hbox{$\scriptstyle {K2}$} \kern-0.1em/\kern-0.15em \lower0.25ex\hbox{$\scriptstyle {\ln \left( {\frac{K1}{{{\text{L}}\lambda { } + { }1}}} \right)}$}} - 273.15 $$
(3)

where BT is the Top of atmosphere brightness temperature (°C), Lλ is TOA spectral radiance (W.m−2 .sr−1 .µm−1)), K1 is the K1 constant band number, and K2 is the K2 constant band number. For Landsat 5, K1 is 607.76, and K2 is 1260.56.

Normalized difference vegetation index (NDVI)

The Normalized Difference Vegetation Index (NDVI) is a standardized vegetation index which reveals the intensity of greenness and surface radiant temperature of the area30,31. The index value of NDVI usually ranges from − 1 to 1. The higher NDVI value indicates that the vegetation of the area is denser and healthier. This shows that the NDVI values of normal healthy vegetation range from 0.1– 0.75, while it is almost zero for rock and soil, and negative value for water bodies24. The NDVI is calculated using the followings:

$$ {\text{NDVI }} = \frac{{\left( {{\text{NIR }}{-}{\text{ RED}}} \right){ }}}{{\left( {{\text{NIR }} + {\text{ RED}}} \right)}} $$
(4)

In Landsat 4–7

$$ {\text{NDVI }} = \, \left( {{\text{Band 4 }}{-}{\text{ Band 3}}} \right) \, / \, \left( {{\text{Band 4 }} + {\text{ Band 3}}} \right) $$

In Landsat 8

$$ {\text{NDVI }} = \, \left( {{\text{Band 5 }}{-}{\text{ Band 4}}} \right) \, / \, \left( {{\text{Band 5 }} + {\text{ Band 4}}} \right) $$

where: RED = DN values from the RED band, and NIR = DN values from the Near Infra-red band.

Land Surface Emissivity (LSE)

Land Surface Emissivity is the average emissivity of an element on the surface of the earth calculated from NDVI values.

$$ {\text{PV }} = \left\{ {\frac{{\left( {{\text{NDVI }} - {\text{ NDVImin}}} \right)}}{{\left( {{\text{NDVImax }} - {\text{ NDVImin}}} \right)}}} \right\}^{2} $$
(5)

where PV is the Proportion of vegetation, NDVI is the DN value from the NDVI image, NDVImin is the minimum DN value from the NDVI image, and NDVImax is the maximum DN value from the NDVI image.

$$ {\text{E }} = \left( {0.004{ } \times {\text{PV}}} \right) + 0.986 $$
(6)

where E is land surface emissivity, PV is the Proportion of vegetation, 0.986 corresponds to a correction value of the equation.

Land Surface Temperature (LST)

Land Surface Temperature (LST) is the radiative temperature which is calculated using top of atmosphere brightness temperature, the wavelength of emitted radiance and land surface emissivity.

$$ {\text{LST}} = {\raise0.5ex\hbox{$\scriptstyle {BT}$} \kern-0.1em/\kern-0.15em \lower0.25ex\hbox{$\scriptstyle {\left( {1 + \left( {{\lambda } \times { }\frac{{{\text{BT}}}}{{{\text{c}}2}}} \right) \times {\text{ln}}\left( {\text{E}} \right)} \right)}$}} $$
(7)

Here c2 is 14388. The value of λ for Landsat 5 (Band 6) is 11.5 µm and Landsat 8 (Band 10) is 10.8 µm.

Where BT is the top of atmosphere brightness temperature, λ is the wavelength of emitted radiance, and E is land surface emissivity.

c2 = h*c/s (1.4388*10–2 mK = 14388 mK), h is Planck’s constant (6.626*1034 Js), s is Boltzmann constant (1.38*1023 JK), c is velocity of light (2.998*108 m/s).

Determination of land use and land cover (LULC) of the study area

The Landsat images were pre-screened and subjected to clipping and classification32. The boundary shape file of Penang was used to clip out the area of study.

Image classification

The unsupervised method involving a random assignment of sample training points and supervised methods of satellite image classification was employed in this study for determining the LULC types. This mixture of image classification methods has been reported as vital in achieving a high accuracy level33. Bands 5, 4 and 3 were used to classify Landsat 8 while bands 4, 3 and 2 were used for classifying Landsat 5. We used the extraction by mask in the spatial analyst tool of ArcMap 10.2.1 software to extract the study area from the selected bands of the Landsat satellite images. A widely used supervised image classification method was adopted for classifying the Landsat bands in this study32,34. The principle of operation of this method involves the identification of known sample training points which are then used to classify other unknown points with related spectral signatures35. The three monochromatic satellite bands were combined to produce the false colour composite (FCC) using the data management tool36. This involves drawing polygons on the LULC type to select the training points. The LULC types adopted for this study include urbanized areas, agricultural land, rocks, forests, bare surfaces, and water bodies. These were modified LULC types from IPOC Good Practice Guidance37. To achieve this, a minimum of 40 sample points were selected randomly for each category of LULC type36. Having prior knowledge of the study area assisted in the selection of the training points38.

The multivariate maximum likelihood classification (MLC) technique was used for transforming the images. Other image transformation techniques have been used by researchers. These include the fuzzy set classifier, neural networks (NN) classifier, extraction and classification of homogenous objects (ECHO) classifier, per-field classifier, sub-pixel classifier, decision trees (DTs), support vector machines (SVMs), minimum distance classifier (MDC) and so-on39. The adoption of any of these techniques is dependent on the knowledge of the area of study, band selection, accessibility of data, the complexity of the landscape, the classification algorithm, and the proficiency of the analyst39. We preferred MLC to other techniques in this study due to its reported high level of accuracy in tropical regions32,34. Another reason for choosing MLC is that it is readily incorporated in many widely used GIS software packages. This MLC algorithm operates based on assigning pixels to the highest probability class and establishing the class ownership of such pixels. It is also regarded as a parametric classifier whose data follows almost a normal distribution39. We ensured the accuracy of this classifier by assigning a large number of training sample points using our prior knowledge of the study area.

Description of the LULC categories

The urbanized area is the developed part of the study area. This includes houses, roads, railways, and industries. This is also known to be a settlement in other literature40. Agricultural land is the part of the study area dominated by agricultural activities and herbaceous plants and grasses. Agricultural land is generally a product of deforestation36. Rocks are part of the study area comprising solid mineral materials (rocks). Bare land is the bare soil which is either made open by natural or human activities.

Forests are parts of the study area dominated by trees. They can be primary or secondary forests depending on the rate of disturbances. According to41, forest land is an area having more than 0.5 ha of flora comprising trees (height is above 5 m) with a canopy greater than 10%. The forests in Penang are generally both primary and secondary42. Water bodies are parts of the study area covered by water seasonally or permanently. These include seas, rivers, lakes, ponds, streams, or reservoirs40.

Determination of change in the LULC

The rate and extent of change in the LULC of Penang within the periods under consideration were determined following the formula below43:

$$ {\text{Changed area }}\left( {{\text{C}}_{{\text{a}}} } \right) \, = {\text{ T}}_{{\text{a}}} \left( {\text{year 2}} \right) \, {-}{\text{ T}}_{{\text{a}}} \left( {\text{year 1}} \right) $$
(8)
$$ {\text{Changed extent }}\left( {{\text{C}}_{{\text{e}}} } \right) \, = {\text{ C}}_{{\text{a}}} /{\text{ T}}_{{\text{a}}} \left( {\text{year 1}} \right) $$
(9)
$$ {\text{Percentage of change }} = {\text{ C}}_{{\text{e}}} {\text{x 1}}00 $$
(10)

where Ta means the total area.

Determination of relationship between LST and NDVI

The values of LST and NDVI at 20 random points of each LULC class were used. The relationship between the LST and NDVI across all the LULC classes in each year was determined using the bivariate linear regression analysis. This was done in Paleontological Statistical (PAST) package 3.0.

Classification accuracy assessment

The classification accuracy was assessed by taking ground truth coordinate data of the LULC of the study area using a geographical positioning system (GPS) device (Garmin Etrex 10). These data were compared with the LULC classified in this study32. Consequently, an error matrix was generated. This normally uses ground truth data to explain the accuracy of the classified LULC. The error matrix comprises the user’s accuracy, the producer’s accuracy, overall accuracy and the Kappa index32.

The producer’s accuracy (omission error) represents the probability of the correctly classified reference pixel and it is determined using this formula below:

$${\text{Producer's accuracy }}\left( \% \right) \, = { 1}00\% \, - {\text{ error of omission}} $$
(11)

Also, the user’s accuracy (commission error) represents the probability that the classified pixel matches the one on the ground36 and it is determined using the formula below:

$$ {\text{User's accuracy }}\left( \% \right) \, = { 1}00\% \, - {\text{ error of commission}} $$
(12)

The statistical accuracy of the matrix was determined using the Kappa coefficient44. This Kappa coefficient ranges from − 1 to + 145. Therefore, the overall accuracy of the classification was determined by dividing the total number of correctly classified pixels by the total number of sampled ground truth data40.