To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nkf???nk^}Y???nkf???nk^}bE 0011111100111111001111110110111001101011011001100011111100111111001111110110111001101011010111100111110101011001001111110011111100111111011011100110101101100110001111110011111100111111011011100110101101011110011111010110001001000101 3f3f3f6e6b663f3f3f6e6b5e7d593f3f3f6e6b663f3f3f6e6b5e7d6245
SJIS-WIN 彫頭恪nkf彫頭恪nk^}Y彫頭恪nkf彫頭恪nk^}bE 1001001010100100100100111010101010011100100011010110111001101011011001101001001010100100100100111010101010011100100011010110111001101011010111100111110101011001100100101010010010010011101010101001110010001101011011100110101101100110100100101010010010010011101010101001110010001101011011100110101101011110011111010110001001000101 92a493aa9c8d6e6b6692a493aa9c8d6e6b5e7d5992a493aa9c8d6e6b6692a493aa9c8d6e6b5e7d6245
EUC-JP 彫頭恪nkf彫頭恪nk^}Y彫頭恪nkf彫頭恪nk^}bE 1100010010100110110001101010110011010111111011010110111001101011011001101100010010100110110001101010110011010111111011010110111001101011010111100111110101011001110001001010011011000110101011001101011111101101011011100110101101100110110001001010011011000110101011001101011111101101011011100110101101011110011111010110001001000101 c4a6c6acd7ed6e6b66c4a6c6acd7ed6e6b5e7d59c4a6c6acd7ed6e6b66c4a6c6acd7ed6e6b5e7d6245
UTF-8 彫頭恪nkf彫頭恪nk^}Y彫頭恪nkf彫頭恪nk^}bE 1110010110111101101010111110100110100000101011011110011010000001101010100110111001101011011001101110010110111101101010111110100110100000101011011110011010000001101010100110111001101011010111100111110101011001111001011011110110101011111010011010000010101101111001101000000110101010011011100110101101100110111001011011110110101011111010011010000010101101111001101000000110101010011011100110101101011110011111010110001001000101 e5bdabe9a0ade681aa6e6b66e5bdabe9a0ade681aa6e6b5e7d59e5bdabe9a0ade681aa6e6b66e5bdabe9a0ade681aa6e6b5e7d6245
UHC 彫頭恪nkf彫頭恪nk^}Y彫頭恪nkf彫頭恪nk^}bE 1111000011000001110101001110100111001010110000010110111001101011011001101111000011000001110101001110100111001010110000010110111001101011010111100111110101011001111100001100000111010100111010011100101011000001011011100110101101100110111100001100000111010100111010011100101011000001011011100110101101011110011111010110001001000101 f0c1d4e9cac16e6b66f0c1d4e9cac16e6b5e7d59f0c1d4e9cac16e6b66f0c1d4e9cac16e6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)