To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 短造探巽短造探巽[短造探巽短造探巽[^ 1001001001011010100100011010001010010010010101001001001001000110100100100101101010010001101000101001001001010100100100100100011001011011100100100101101010010001101000101001001001010100100100100100011010010010010110101001000110100010100100100101010010010010010001100101101101011110 925a91a292549246925a91a2925492465b925a91a292549246925a91a2925492465b5e
EUC-JP 短造探巽短造探巽[短造探巽短造探巽[^ 1100001110111011110000101010010011000011101101011100001110100111110000111011101111000010101001001100001110110101110000111010011101011011110000111011101111000010101001001100001110110101110000111010011111000011101110111100001010100100110000111011010111000011101001110101101101011110 c3bbc2a4c3b5c3a7c3bbc2a4c3b5c3a75bc3bbc2a4c3b5c3a7c3bbc2a4c3b5c3a75b5e
UTF-8 短造探巽短造探巽[短造探巽短造探巽[^ 111001111001111110101101111010011000000010100000111001101000111010100010111001011011011110111101111001111001111110101101111010011000000010100000111001101000111010100010111001011011011110111101010110111110011110011111101011011110100110000000101000001110011010001110101000101110010110110111101111011110011110011111101011011110100110000000101000001110011010001110101000101110010110110111101111010101101101011110 e79fade980a0e68ea2e5b7bde79fade980a0e68ea2e5b7bd5be79fade980a0e68ea2e5b7bde79fade980a0e68ea2e5b7bd5b5e
UHC 短造探巽短造探巽[短造探巽短造探巽[^ 1101001110101101111100001110001111110111101011101110000111011110110100111010110111110000111000111111011110101110111000011101111001011011110100111010110111110000111000111111011110101110111000011101111011010011101011011111000011100011111101111010111011100001110111100101101101011110 d3adf0e3f7aee1ded3adf0e3f7aee1de5bd3adf0e3f7aee1ded3adf0e3f7aee1de5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)