To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 閼ア螂ェ螟夂矯譚溷援閼ア螂ェ螟夂矯譚溷援B 11101000100001001011000111100101101001011010101011100101101001001001101011100111100010111011100011100110100111011001111111100101100010011000011111101000100001001011000111100101101001011010101011100101101001001001101011100111100010111011100011100110100111011001111111100101100010011000011101000010 e884b1e5a5aae5a49ae78bb8e69d9fe58987e884b1e5a5aae5a49ae78bb8e69d9fe5898742
EUC-JP 閼ア螂ェ螟夂矯譚溷援閼ア螂ェ螟夂矯譚溷援B 1110111111100100100011101011000111101010101001111000111010101010111010101010011011010100111010011011011010111010111010111111110111011110111001111011000111100111111011111110010010001110101100011110101010100111100011101010101011101010101001101101010011101001101101101011101011101011111111011101111011100111101100011110011101000010 efe48eb1eaa78eaaeaa6d4e9b6baebfddee7b1e7efe48eb1eaa78eaaeaa6d4e9b6baebfddee7b1e742
UTF-8 閼ア螂ェ螟夂矯譚溷援閼ア螂ェ螟夂矯譚溷援B 11101001100101101011110011101111101111011011000111101000100111101000001011101111101111011010101011101000100111101001111111100101101001001000001011100111100111111010111111101000101011011001101011100110101110101011011111100110100011111011010011101001100101101011110011101111101111011011000111101000100111101000001011101111101111011010101011101000100111101001111111100101101001001000001011100111100111111010111111101000101011011001101011100110101110101011011111100110100011111011010001000010 e996bcefbdb1e89e82efbdaae89e9fe5a482e79fafe8ad9ae6bab7e68fb4e996bcefbdb1e89e82efbdaae89e9fe5a482e79fafe8ad9ae6bab7e68fb442
UHC 閼?螂?螟?矯譚?援閼?螂?螟?矯譚?援B 111001001101100100111111110101011100110000111111110110011010110100111111110011101110110011010011110010010011111111101010101101011110010011011001001111111101010111001100001111111101100110101101001111111100111011101100110100111100100100111111111010101011010101000010 e4d93fd5cc3fd9ad3fceecd3c93feab5e4d93fd5cc3fd9ad3fceecd3c93feab542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)