To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訂蛟??彫??蛟??諍?訂蛟??漬??蛟???? 100100101111100111100101100000000011111100111111100100101010010000111111001111111110010110000000001111110011111111100110011110010011111110010010111110011110010110000000001111110011111110010010110100000011111100111111111001011000000000111111001111110011111100111111 92f9e5803f3f92a43f3fe5803f3fe6793f92f9e5803f3f92d03f3fe5803f3f3f3f
EUC-JP 訂蛟??彫?邕蛟??諍?訂蛟??漬?邕蛟???? 11000100111110111110100111100000001111110011111111000100101001100011111110001111111000011110110111101001111000000011111100111111111010111101101000111111110001001111101111101001111000000011111100111111110001001101001000111111100011111110000111101101111010011110000000111111001111110011111100111111 c4fbe9e03f3fc4a63f8fe1ede9e03f3febda3fc4fbe9e03f3fc4d23f8fe1ede9e03f3f3f3f
UTF-8 訂蛟렰렪彫렞邕蛟렰렧諍렮訂蛟렰렪漬렓邕蛟렰렧卽렗 111010001010100010000010111010001001101110011111111010111010000010110000111010111010000010101010111001011011110110101011111010111010000010011110111010011000001010010101111010001001101110011111111010111010000010110000111010111010000010100111111010001010101110001101111010111010000010101110111010001010100010000010111010001001101110011111111010111010000010110000111010111010000010101010111001101011110010101100111010111010000010010011111010011000001010010101111010001001101110011111111010111010000010110000111010111010000010100111111001011000110110111101111010111010000010010111 e8a882e89b9feba0b0eba0aae5bdabeba09ee98295e89b9feba0b0eba0a7e8ab8deba0aee8a882e89b9feba0b0eba0aae6bcaceba093e98295e89b9feba0b0eba0a7e58dbdeba097
UHC 訂蛟렰렪彫렞邕蛟렰렧諍렮訂蛟렰렪漬렓邕蛟렰렧卽렗 111011111111010011001110111100011000111010111101100011101011100011110000110000011000111010101111111010001011101111001110111100011000111010111101100011101011011011101110101101011000111010111011111011111111010011001110111100011000111010111101100011101011100011110010101100001000111010101000111010001011101111001110111100011000111010111101100011101011011011110001111011011000111010101100 eff4cef18ebd8eb8f0c18eafe8bbcef18ebd8eb6eeb58ebbeff4cef18ebd8eb8f2b08ea8e8bbcef18ebd8eb6f1ed8eac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)