To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 咫孟??贈????烽緋肢?贈????六 100110100100000010010110110100000011111100111111100100011010000100111111001111110011111100111111111000001000001010010100111010101000111010001000001111111001000110100001001111110011111100111111001111111001100001011010 9a4096d03f3f91a13f3f3f3fe08294ea8e883f91a13f3f3f3f985a
EUC-JP 咫孟??贈????烽緋肢?贈????六 110100111010000111001100110100100011111100111111110000101010001100111111001111110011111100111111110111111110001011001000111011001011101111101000001111111100001010100011001111110011111100111111001111111100111110111011 d3a1ccd23f3fc2a33f3f3f3fdfe2c8ecbbe83fc2a33f3f3f3fcfbb
UTF-8 咫孟렣렖贈얹렱폈렱烽緋肢렖贈얹렱폈렱六 111001011001001010101011111001011010110110011111111010111010000010100011111010111010000010010110111010001011010010001000111011001001011010111001111010111010000010110001111011011000111110001000111010111010000010110001111001111000001110111101111001111011011110001011111010001000001010100010111010111010000010010110111010001011010010001000111011001001011010111001111010111010000010110001111011011000111110001000111010111010000010110001111001011000010110101101 e592abe5ad9feba0a3eba096e8b488ec96b9eba0b1ed8f88eba0b1e783bde7b78be882a2eba096e8b488ec96b9eba0b1ed8f88eba0b1e585ad
UHC 咫孟렣렖贈얹렱폈렱烽緋肢렖贈얹렱폈렱六 1111001010100001110110001110101110001110101101001000111010101011111100011111110010111110111100011000111010111110110001101111000110001110101111101101110011101011110111011111110011110010101101101000111010101011111100011111110010111110111100011000111010111110110001101111000110001110101111101101011110111111 f2a1d8eb8eb48eabf1fcbef18ebec6f18ebedcebddfcf2b68eabf1fcbef18ebec6f18ebed7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)