To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 額??韋??純??馭??誼→?攸??沃 100010100111101000111111001111111110100011101000001111110011111110001111100000110011111100111111111010010110011000111111001111111000101101100010100000011010100000111111100111011011111100111111001111111001011110000000 8a7a3f3fe8e83f3f8f833f3fe9663f3f8b6281a83f9dbf3f3f9780
EUC-JP 額??韋??純??馭??誼→?攸??沃 101100111101101100111111001111111111000011101010001111110011111110111101111000110011111100111111111100011100011100111111001111111011010111000011101000101010101000111111110110101100000100111111001111111100110111100000 b3db3f3ff0ea3f3fbde33f3ff1c73f3fb5c3a2aa3fdac13f3fcde0
UTF-8 額곗눖韋륅쭏純놁춷馭귙꺃誼→퐗攸됲맊沃 111010011010000110001101111010101011001110010111111010111000100010010110111010011001111110001011111010111010010110000101111011001010110110001111111001111011010010010100111010111000011010000001111011001011011010110111111010011010011010101101111010101011011110011001111010101011101010000011111010001010101010111100111000101000011010010010111011011001000010010111111001101001010010111000111010111001000010110010111010111010011110001010111001101011001010000011 e9a18deab397eb8896e99f8beba585ecad8fe7b494eb8681ecb6b7e9a6adeab799eaba83e8aabce28692ed9097e694b8eb90b2eba78ae6b283
UHC 額곗눖韋륅쭏純놁춷馭귙꺃誼→퐗攸됲맊沃 1110010011111110101100001110110010000111101100001110101011011111100011111110111110100111100010001110001011101101100001101110110010101101100100111110010111011111100000101110001110000011101011001110101111111110101000011110011010111101100000011110101011110010100010011110110110010000101000101110100010101010 e4feb0ec87b0eadf8fefa788e2ed86ecad93e5df82e383acebfea1e6bd81eaf289ed90a2e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)