To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 艶??蟻??艶??蟻??艶??蟻??艶??蟻??^ 100010011001000000111111001111111000101101100001001111110011111110001001100100000011111100111111100010110110000100111111001111111000100110010000001111110011111110001011011000010011111100111111100010011001000000111111001111111000101101100001001111110011111101011110 89903f3f8b613f3f89903f3f8b613f3f89903f3f8b613f3f89903f3f8b613f3f5e
EUC-JP 艶??蟻??艶??蟻??艶??蟻??艶??蟻??^ 101100011111000000111111001111111011010111000010001111110011111110110001111100000011111100111111101101011100001000111111001111111011000111110000001111110011111110110101110000100011111100111111101100011111000000111111001111111011010111000010001111110011111101011110 b1f03f3fb5c23f3fb1f03f3fb5c23f3fb1f03f3fb5c23f3fb1f03f3fb5c23f3f5e
UTF-8 艶녘떏蟻긷젽艶녘떏蟻깆꼵艶녘떏蟻긷젽艶녘떏蟻깆꼵^ 11101000100010011011011011101011100001011001100011101011100101101000111111101000100111111011101111101010101110001011011111101100101000001011110111101000100010011011011011101011100001011001100011101011100101101000111111101000100111111011101111101010101110011000011011101010101111001011010111101000100010011011011011101011100001011001100011101011100101101000111111101000100111111011101111101010101110001011011111101100101000001011110111101000100010011011011011101011100001011001100011101011100101101000111111101000100111111011101111101010101110011000011011101010101111001011010101011110 e889b6eb8598eb968fe89fbbeab8b7eca0bde889b6eb8598eb968fe89fbbeab986eabcb5e889b6eb8598eb968fe89fbbeab8b7eca0bde889b6eb8598eb968fe89fbbeab986eabcb55e
UHC 艶녘떏蟻긷젽艶녘떏蟻깆꼵艶녘떏蟻긷젽艶녘떏蟻깆꼵^ 11100110111111011011001111101000100010111010010111101011111111001011000111100101101000001010111111100110111111011011001111101000100010111010010111101011111111001011000111101100100001001000110111100110111111011011001111101000100010111010010111101011111111001011000111100101101000001010111111100110111111011011001111101000100010111010010111101011111111001011000111101100100001001000110101011110 e6fdb3e88ba5ebfcb1e5a0afe6fdb3e88ba5ebfcb1ec848de6fdb3e88ba5ebfcb1e5a0afe6fdb3e88ba5ebfcb1ec848d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)