To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????h???????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????????????h???????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ????????????h???????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챔청혯챔짤혨챗첸째챘혻혧h챔청혯챔짤혨챗첸째챘혻혧 11101100101100011001010011101100101100101010110111101101100110001010111111101100101100011001010011101100101001111010010011101101100110001010100011101100101100011001011111101100101100101011100011101100101001111011100011101100101100011001100011101101100110001011101111101101100110001010011101101000111011001011000110010100111011001011001010101101111011011001100010101111111011001011000110010100111011001010011110100100111011011001100010101000111011001011000110010111111011001011001010111000111011001010011110111000111011001011000110011000111011011001100010111011111011011001100010100111 ecb194ecb2aded98afecb194eca7a4ed98a8ecb197ecb2b8eca7b8ecb198ed98bbed98a768ecb194ecb2aded98afecb194eca7a4ed98a8ecb197ecb2b8eca7b8ecb198ed98bbed98a7
UHC 챔청혯챔짤혨챗첸째챘혻혧h챔청혯챔짤혨챗첸째챘혻혧 11000011101010001100001110111011110000101001011011000011101010001100001010101001110000101001000011000011101010101100001110111110110000101011000011000011101010111100001010100000110000101000111101101000110000111010100011000011101110111100001010010110110000111010100011000010101010011100001010010000110000111010101011000011101111101100001010110000110000111010101111000010101000001100001010001111 c3a8c3bbc296c3a8c2a9c290c3aac3bec2b0c3abc2a0c28f68c3a8c3bbc296c3a8c2a9c290c3aac3bec2b0c3abc2a0c28f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)