To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 夭c?旬??飮?? 10011010111011101000001010000011001111111000111101111011001111110011111110011111010110100011111100111111 9aee82833f8f7b3f3f9f5a3f3f
EUC-JP 夭c?旬??飮?? 11010100111100001010001111100011001111111011110111011100001111110011111111011101101110110011111100111111 d4f0a3e33fbddc3f3fddbb3f3f
UTF-8 夭c냻旬룟냶飮귥뿿 111001011010010010101101111011111011110110000011111010111000001110111011111001101001011110101100111010111010001110011111111010111000001110110110111010011010001110101110111010101011011110100101111010111011111110111111 e5a4adefbd83eb83bbe697aceba39feb83b6e9a3aeeab7a5ebbfbf
UHC 夭c냻旬룟냶飮귥뿿 111010001110110010100011111000111000011010001011111000101110001010110111111001011000011010000110111010111110011010000010111011001001011110111111 e8eca3e3868be2e2b7e58686ebe682ec97bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)