To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 歪??謠①?堯??俉??歪??謠①?堯??俉??^ 1001100001100011001111110011111111100110100011111000011101000000001111111110101010011111001111110011111111111010011000010011111100111111100110000110001100111111001111111110011010001111100001110100000000111111111010101001111100111111001111111111101001100001001111110011111101011110 98633f3fe68f87403fea9f3f3ffa613f3f98633f3fe68f87403fea9f3f3ffa613f3f5e
EUC-JP 歪??謠??堯??俉??歪??謠??堯??俉??^ 1100111111000100001111110011111111101011111011110011111100111111111101001010000100111111001111111000111110110001101110110011111100111111110011111100010000111111001111111110101111101111001111110011111111110100101000010011111100111111100011111011000110111011001111110011111101011110 cfc43f3febef3f3ff4a13f3f8fb1bb3f3fcfc43f3febef3f3ff4a13f3f8fb1bb3f3f5e
UTF-8 歪득찀謠①맏堯뗣띂俉듐깯歪득찀謠①맏堯뗣띂俉듐깭^ 11100110101011011010101011101011100100111001110111101100101100001000000011101000101011001010000011100010100100011010000011101011101001111000111111100101101000001010111111101011100101111010001111101011100111011000001011100100101111111000100111101011100100111001000011101010101110011010111111100110101011011010101011101011100100111001110111101100101100001000000011101000101011001010000011100010100100011010000011101011101001111000111111100101101000001010111111101011100101111010001111101011100111011000001011100100101111111000100111101011100100111001000011101010101110011010110101011110 e6adaaeb939decb080e8aca0e291a0eba78fe5a0afeb97a3eb9d82e4bf89eb9390eab9afe6adaaeb939decb080e8aca0e291a0eba78fe5a0afeb97a3eb9d82e4bf89eb9390eab9ad5e
UHC 歪득찀謠①맏堯뗣띂俉듐깯歪득찀謠①맏堯뗣띂俉듐깭^ 11101000111000001011010111100110101010011000010011101001101010101010100011100111101110001011101011101000111010111000101111100011100011011011110111100111111010111011010111100011100000111001111011101000111000001011010111100110101010011000010011101001101010101010100011100111101110001011101011101000111010111000101111100011100011011011110111100111111010111011010111100011100000111001110001011110 e8e0b5e6a984e9aaa8e7b8bae8eb8be38dbde7ebb5e3839ee8e0b5e6a984e9aaa8e7b8bae8eb8be38dbde7ebb5e3839c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)