To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?東慂午雰嶝?橋括?東慂午雰嶝?橋跨^ 001111111001001110001100100111001100100010001100110111111001010110110101100110111101000100111111100010111011010010001010100001110011111110010011100011001001110011001000100011001101111110010101101101011001101111010001001111111000101110110100100011001101011101011110 3f938c9cc88cdf95b59bd13f8bb48a873f938c9cc88cdf95b59bd13f8bb48cd75e
EUC-JP ?東慂午雰嶝?橋括?東慂午雰嶝?橋跨^ 001111111100010111101100110110001100101010111000111000011100101010110111110101101101001100111111101101101011011010110011111001110011111111000101111011001101100011001010101110001110000111001010101101111101011011010011001111111011011010110110101110001101100101011110 3fc5ecd8cab8e1cab7d6d33fb6b6b3e73fc5ecd8cab8e1cab7d6d33fb6b6b8d95e
UTF-8 뤯東慂午雰嶝렱橋括뤯東慂午雰嶝렱橋跨^ 11101011101001001010111111100110100111011011000111100110100001011000001011100101100011011000100011101001100110111011000011100101101101101001110111101011101000001011000111100110101010011000101111100110100010111010110011101011101001001010111111100110100111011011000111100110100001011000001011100101100011011000100011101001100110111011000011100101101101101001110111101011101000001011000111100110101010011000101111101000101101111010100001011110 eba4afe69db1e68582e58d88e99bb0e5b69deba0b1e6a98be68baceba4afe69db1e68582e58d88e99bb0e5b69deba0b1e6a98be8b7a85e
UHC 뤯東慂午雰嶝렱橋括뤯東慂午雰嶝렱橋跨^ 10001111110111011101010011010100111010011011110111100111111011011101110111010100110101001111000110001110101111101100111011101001110011101100000010001111110111011101010011010100111010011011110111100111111011011101110111010100110101001111000110001110101111101100111011101001110011101010010101011110 8fddd4d4e9bde7edddd4d4f18ebecee9cec08fddd4d4e9bde7edddd4d4f18ebecee9cea55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)