To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??Jkf??Jk^}Y??Jkf??Jk^}bE 00111111001111110100101001101011011001100011111100111111010010100110101101011110011111010101100100111111001111110100101001101011011001100011111100111111010010100110101101011110011111010110001001000101 3f3f4a6b663f3f4a6b5e7d593f3f4a6b663f3f4a6b5e7d6245
SJIS-WIN 叩巽Jkf叩巽Jk^}Y叩巽Jkf叩巽Jk^}bE 100100100100000010010010010001100100101001101011011001101001001001000000100100100100011001001010011010110101111001111101010110011001001001000000100100100100011001001010011010110110011010010010010000001001001001000110010010100110101101011110011111010110001001000101 924092464a6b66924092464a6b5e7d59924092464a6b66924092464a6b5e7d6245
EUC-JP 叩巽Jkf叩巽Jk^}Y叩巽Jkf叩巽Jk^}bE 110000111010000111000011101001110100101001101011011001101100001110100001110000111010011101001010011010110101111001111101010110011100001110100001110000111010011101001010011010110110011011000011101000011100001110100111010010100110101101011110011111010110001001000101 c3a1c3a74a6b66c3a1c3a74a6b5e7d59c3a1c3a74a6b66c3a1c3a74a6b5e7d6245
UTF-8 叩巽Jkf叩巽Jk^}Y叩巽Jkf叩巽Jk^}bE 1110010110001111101010011110010110110111101111010100101001101011011001101110010110001111101010011110010110110111101111010100101001101011010111100111110101011001111001011000111110101001111001011011011110111101010010100110101101100110111001011000111110101001111001011011011110111101010010100110101101011110011111010110001001000101 e58fa9e5b7bd4a6b66e58fa9e5b7bd4a6b5e7d59e58fa9e5b7bd4a6b66e58fa9e5b7bd4a6b5e7d6245
UHC 叩巽Jkf叩巽Jk^}Y叩巽Jkf叩巽Jk^}bE 110011011011000011100001110111100100101001101011011001101100110110110000111000011101111001001010011010110101111001111101010110011100110110110000111000011101111001001010011010110110011011001101101100001110000111011110010010100110101101011110011111010110001001000101 cdb0e1de4a6b66cdb0e1de4a6b5e7d59cdb0e1de4a6b66cdb0e1de4a6b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)