To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 耶??猷??宋??俉?????魏??耶??毅? 100101101110101100111111001111111001011101010001001111110011111110010001011101100011111100111111111110100110000100111111001111110011111100111111001111111110100110110000001111110011111110010110111010110011111100111111100010110100001000111111 96eb3f3f97513f3f91763f3ffa613f3f3f3f3fe9b03f3f96eb3f3f8b423f
EUC-JP 耶??猷??宋??俉?????魏??耶??毅? 11001100111011010011111100111111110011011011001000111111001111111100000111010111001111110011111110001111101100011011101100111111001111110011111100111111001111111111001010110010001111110011111111001100111011010011111100111111101101011010001100111111 cced3f3fcdb23f3fc1d73f3f8fb1bb3f3f3f3f3ff2b23f3fcced3f3fb5a33f
UTF-8 耶쇰씈猷녷끽宋믩쨨俉묒궏琉녽레魏녿뮚耶쇰똻毅륛 111010001000000010110110111011001000011110110000111011001001010010001000111001111000110010110111111010111000010110110111111010111000000110111101111001011010111010001011111010111010111110101001111011001010100010101000111001001011111110001001111010111010110010010010111010101011011010001111111011111010011110001100111010111000010110111101111010111010000010001000111010011010110110001111111010111000010110111111111010111010111010011010111010001000000010110110111011001000011110110000111010111001100010111011111001101010111110000101111010111010010110011011 e880b6ec87b0ec9488e78cb7eb85b7eb81bde5ae8bebafa9eca8a8e4bf89ebac92eab68fefa78ceb85bdeba088e9ad8feb85bfebae9ae880b6ec87b0eb98bbe6af85eba59b
UHC 耶쇰씈猷녷끽宋믩쨨俉묒궏琉녽레魏녿뮚耶쇰똻毅륛 11100101101011011011110011101011100111011010000011101011101000111000011011100110101100111010001111100001111001001001001011101011101001001000001111100111111010111001000111101100100000101010010111101011101001001000011011101001101101111011100111101010111000001000011011101011100100101010011011100101101011011011110011101011100011001000000111101011111101101001000001000010 e5adbceb9da0eba386e6b3a3e1e492eba483e7eb91ec82a5eba486e9b7b9eae086eb92a6e5adbceb8c81ebf69042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)