To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??鏃???鏃?⑪血?忿基??趾???證?B 001111110011111111101000010101100011111100111111001111111110100001010110001111111000011101001010100011001000110000111111100111000111110010001010111011100011111100111111111001101110010000111111001111110011111111100110100110100011111101000010 3f3fe8563f3f3fe8563f874a8c8c3f9c7c8aee3f3fe6e43f3f3fe69a3f42
EUC-JP ??鏃???鏃??血?忿基??趾???證?B 0011111100111111111011111011011100111111001111110011111111101111101101110011111100111111101101111110110000111111110101111101110110110100111100000011111100111111111011001110011000111111001111110011111111101011111110100011111101000010 3f3fefb73f3f3fefb73f3fb7ec3fd7ddb4f03f3fece63f3f3febfa3f42
UTF-8 뤯훵鏃퐥띵였鏃퐥⑪血쳪忿基렰렧趾댓렰렓證렖B 11101011101001001010111111101101100110111011010111101001100011111000001111101101100100001010010111101011100111011011010111101100100110001000000011101001100011111000001111101101100100001010010111100010100100011010101011101000101000011000000011101100101100111010101011100101101111111011111111100101100111111011101011101011101000001011000011101011101000001010011111101000101101101011111011101011100011001001001111101011101000001011000011101011101000001001001111101000101011011000100111101011101000001001011001000010 eba4afed9bb5e98f83ed90a5eb9db5ec9880e98f83ed90a5e291aae8a180ecb3aae5bfbfe59fbaeba0b0eba0a7e8b6beeb8c93eba0b0eba093e8ad89eba09642
UHC 뤯훵鏃퐥띵였鏃퐥⑪血쳪忿基렰렧趾댓렰렓證렖B 10001111110111011100100011010000111100001110110010111101100011101011011011110010101111111011010011110000111011001011110110001110101010001111000111111010111011001010101110001111110111011100100011010000111100011000111010111101100011101011011011110010101111111011010011110001100011101011110110001110101010001111000111111011100011101010101101000010 8fddc8d0f0ecbd8eb6f2bfb4f0ecbd8ea8f1faecab8fddc8d0f18ebd8eb6f2bfb4f18ebd8ea8f1fb8eab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)