To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 額?????獄??湲 10001010011110100011111100111111001111110011111100111111100011011001011000111111001111111001111111010001 8a7a3f3f3f3f3f8d963f3f9fd1
EUC-JP 額?????獄??湲 10110011110110110011111100111111001111110011111100111111101110011111011000111111001111111101111011010011 b3db3f3f3f3f3fb9f63f3fded3
UTF-8 額ㅻ퉬溜곕젒獄몄뇿湲 111010011010000110001101111000111000010110111011111011011000100110101100111011111010011110001011111010101011001110010101111011001010000010010010111001111000110110000100111010111010101010000100111010111000011110111111111001101011100110110010 e9a18de385bbed89acefa78beab395eca092e78d84ebaa84eb87bfe6b9b2
UHC 額ㅻ퉬溜곕젒獄몄뇿湲 1110010011111110101001001110101110111001100001001110101011111110101100001110101110100000100100011110100010101011101110001110110010000111101000001110101010111000 e4fea4ebb984eafeb0eba091e8abb8ec87a0eab8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)