To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??伊屯?弔?衣?∧ 001111110011111110001000110010011001001111010100001111111001001010100010001111111000100011011111001111111000000111001000 3f3f88c993d43f92a23f88df3f81c8
EUC-JP ??伊屯?弔?衣?∧ 001111110011111110110000110010111100011011010110001111111100010010100100001111111011000011100001001111111010001011001010 3f3fb0cbc6d63fc4a43fb0e13fa2ca
UTF-8 欌렪伊屯횅弔렲衣쯤∧ 111001101010110010001100111010111010000010101010111001001011110010001010111001011011000110101111111011011001101010000101111001011011110010010100111010111010000010110010111010001010000110100011111011001010111110100100111000101000100010100111 e6ac8ceba0aae4bc8ae5b1afed9a85e5bc94eba0b2e8a1a3ecafa4e288a7
UHC 欌렪伊屯횅弔렲衣쯤∧ 1110110111101011100011101011100011101100101001011101010011101010110010001011011111110000110000001000111010111111111010111111110111000010111010111010000111111100 edeb8eb8eca5d4eac8b7f0c08ebfebfdc2eba1fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)