To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????K}?????????K{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100101101111101001111110011111100111111001111110011111100111111001111110011111100111111010010110111101101011110 3f3f3f3f3f3f3f3f3f4b7d3f3f3f3f3f3f3f3f3f4b7b5e
SJIS-WIN 瘟??筌??瘟??K}瘟??筌??瘟??K{^ 1110000110001001001111110011111111100010101000110011111100111111111000011000100100111111001111110100101101111101111000011000100100111111001111111110001010100011001111110011111111100001100010010011111100111111010010110111101101011110 e1893f3fe2a33f3fe1893f3f4b7de1893f3fe2a33f3fe1893f3f4b7b5e
EUC-JP 瘟??筌??瘟??K}瘟??筌??瘟??K{^ 1110000111101001001111110011111111100100101001010011111100111111111000011110100100111111001111110100101101111101111000011110100100111111001111111110010010100101001111110011111111100001111010010011111100111111010010110111101101011110 e1e93f3fe4a53f3fe1e93f3f4b7de1e93f3fe4a53f3fe1e93f3f4b7b5e
UTF-8 瘟욕즯筌껃땭瘟욜찂K}瘟욕즯筌껃땭瘟욜찂K{^ 1110011110011000100111111110110010011010100101011110110010100110101011111110011110101101100011001110101010111011100000111110101110010101101011011110011110011000100111111110110010011010100111001110110010110000100000100100101101111101111001111001100010011111111011001001101010010101111011001010011010101111111001111010110110001100111010101011101110000011111010111001010110101101111001111001100010011111111011001001101010011100111011001011000010000010010010110111101101011110 e7989fec9a95eca6afe7ad8ceabb83eb95ade7989fec9a9cecb0824b7de7989fec9a95eca6afe7ad8ceabb83eb95ade7989fec9a9cecb0824b7b5e
UHC 瘟욕즯筌껃땭瘟욜찂K}瘟욕즯筌껃땭瘟욜찂K{^ 1110100010110000101111111110010110100011100000011110111110100111100000111110010110001011100000111110100010110000101111111110011110101001100001100100101101111101111010001011000010111111111001011010001110000001111011111010011110000011111001011000101110000011111010001011000010111111111001111010100110000110010010110111101101011110 e8b0bfe5a381efa783e58b83e8b0bfe7a9864b7de8b0bfe5a381efa783e58b83e8b0bfe7a9864b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)