To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥宜??亦??違 111001011111000100111111100000010110000110001011010110000011111100111111100101101001001000111111001111111000100011100001 e5f13f81618b583f3f96923f3f88e1
EUC-JP 褥?‖宜??亦??違 111010101111001100111111101000011100001010110101101110010011111100111111110010111111001000111111001111111011000011100011 eaf33fa1c2b5b93f3fcbf23f3fb0e3
UTF-8 褥띕∥宜뱁걫亦껋눛違 111010001010010010100101111010111001110110010101111000101000100010100101111001011010111010011100111010111011000110000001111010101011000110101011111001001011101010100110111010101011101110001011111010111000100010011011111010011000000110010101 e8a4a5eb9d95e288a5e5ae9cebb181eab1abe4baa6eabb8beb889be98195
UHC 褥띕∥宜뱁걫亦껋눛違 1110100110110011101101101110101110100001101010111110101111110001101110011110110110000001100101001110011010110010100000111110110010000111101100111110101011011110 e9b3b6eba1abebf1b9ed8194e6b283ec87b3eade

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)