To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ??????????????似?????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000111010010111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f8e973f3f3f3f3f68
EUC-JP ??????????????似?????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011101111110111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3fbbf73f3f3f3f3f68
UTF-8 렺렱렻꼼씽렑렻렞렺셍렒렺렱렻似셔후렯렺렜h 11101011101000001011101011101011101000001011000111101011101000001011101111101010101111001011110011101100100101001011110111101011101000001001000111101011101000001011101111101011101000001001111011101011101000001011101011101100100001011000110111101011101000001001001011101011101000001011101011101011101000001011000111101011101000001011101111100100101111001011110011101100100001011001010011101101100110111000010011101011101000001010111111101011101000001011101011101011101000001001110001101000 eba0baeba0b1eba0bbeabcbcec94bdeba091eba0bbeba09eeba0baec858deba092eba0baeba0b1eba0bbe4bcbcec8594ed9b84eba0afeba0baeba09c68
UHC 렺렱렻꼼씽렑렻렞렺셍렒렺렱렻似셔후렯렺렜h 1000111011000010100011101011111010001110110000111011001011000100101111101100010110001110101001101000111011000011100011101010111110001110110000101011110011000100100011101010011110001110110000101000111010111110100011101100001111011110110001001011110011000101110010001100010010001110101111001000111011000010100011101010111001101000 8ec28ebe8ec3b2c4bec58ea68ec38eaf8ec2bcc48ea78ec28ebe8ec3dec4bcc5c8c48ebc8ec28eae68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)