To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????→??≪??ル?踰??誘り??κ?B 0011111100111111001111110011111110000001101010000011111100111111100000011110000100111111001111111000001110001011001111111110011011111010001111110011111110010111010101011000001011101000001111110011111110000011110010000011111101000010 3f3f3f3f81a83f3f81e13f3f838b3fe6fa3f3f975582e83f3f83c83f42
EUC-JP ???沅→??≪??ル?踰??誘り??κ?B 00111111001111110011111110001111110001101110100110100010101010100011111100111111101000101110001100111111001111111010010111101011001111111110110011111100001111110011111111001101101101101010010011101010001111110011111110100110110010100011111101000010 3f3f3f8fc6e9a2aa3f3fa2e33f3fa5eb3fecfc3f3fcdb6a4ea3f3fa6ca3f42
UTF-8 蓮잙슣沅→끽類≪떨曆ル뿭踰껃짃誘り텭若κ씨B 111011111010011010011001111011001001111010011001111011001000101010100011111001101011001010000101111000101000011010010010111010111000000110111101111011111010011110010000111000101000100110101010111010111001011010101000111011111010011010001011111000111000001110101011111010111011111110101101111010001011100010110000111010101011101110000011111011001010011110000011111010001010101010011000111000111000001010001010111011011000010110101101111011111010010110110100110011101011101011101100100101001010100001000010 efa699ec9e99ec8aa3e6b285e28692eb81bdefa790e289aaeb96a8efa68be383abebbfade8b8b0eabb83eca783e8aa98e3828aed85adefa5b4cebaec94a842
UHC 蓮잙슣沅→끽類≪떨曆ル뿭踰껃짃誘り텭若κ씨B 11100110111001011001111111101011100110101010111111101010101101101010000111100110101100111010001111101011101110101010000111101100101101101011001111100110101101111010101111101011100101111010110111101011101100101000001111100101101000111001001111101011101011111010101011101010101101101010000011100101101011101010010111101010101111101011111001000010 e6e59feb9aafeab6a1e6b3a3ebbaa1ecb6b3e6b7abeb97adebb283e5a393ebafaaeab6a0e5aea5eabebe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)