To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??U?????C??U?????CB 00111111001111110101010100111111001111110011111100111111001111110100001100111111001111110101010100111111001111110011111100111111001111110100001101000010 3f3f553f3f3f3f3f433f3f553f3f3f3f3f4342
SJIS-WIN ヘーUモ」モ」CヘーUモ」モ」CB 110011011011000001010101110100111010001111110000111000111101001110100011010000111100110110110000010101011101001110100011111100001110001111010011101000110100001101000010 cdb055d3a3f0e3d3a343cdb055d3a3f0e3d3a34342
EUC-JP ヘーUモ」?モ」CヘーUモ」?モ」CB 10001110110011011000111010110000010101011000111011010011100011101010001100111111100011101101001110001110101000110100001110001110110011011000111010110000010101011000111011010011100011101010001100111111100011101101001110001110101000110100001101000010 8ecd8eb0558ed38ea33f8ed38ea3438ecd8eb0558ed38ea33f8ed38ea34342
UTF-8 ヘーUモ」モ」CヘーUモ」モ」CB 1110111110111110100011011110111110111101101100000101010111101111101111101001001111101111101111011010001111101110100000101010001011101111101111101001001111101111101111011010001101000011111011111011111010001101111011111011110110110000010101011110111110111110100100111110111110111101101000111110111010000010101000101110111110111110100100111110111110111101101000110100001101000010 efbe8defbdb055efbe93efbda3ee82a2efbe93efbda343efbe8defbdb055efbe93efbda3ee82a2efbe93efbda34342
UHC ??U?????C??U?????CB 00111111001111110101010100111111001111110011111100111111001111110100001100111111001111110101010100111111001111110011111100111111001111110100001101000010 3f3f553f3f3f3f3f433f3f553f3f3f3f3f4342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)