To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??}v??}vB 001111110011111101111101011101100011111100111111011111010111011001000010 3f3f7d763f3f7d7642
SJIS-WIN 鞨皆}v鞨皆}vB 11101000111000001000101001000110011111010111011011101000111000001000101001000110011111010111011001000010 e8e08a467d76e8e08a467d7642
EUC-JP 鞨皆}v鞨皆}vB 11110000111000101011001110100111011111010111011011110000111000101011001110100111011111010111011001000010 f0e2b3a77d76f0e2b3a77d7642
UTF-8 鞨皆}v鞨皆}vB 1110100110011110101010001110011110011010100001100111110101110110111010011001111010101000111001111001101010000110011111010111011001000010 e99ea8e79a867d76e99ea8e79a867d7642
UHC 鞨皆}v鞨皆}vB 11001010111010101100101111001011011111010111011011001010111010101100101111001011011111010111011001000010 caeacbcb7d76caeacbcb7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)