To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 艤???R艤???^[艤???R艤???^[^ 111001000111111000111111001111110011111101010010111001000111111000111111001111110011111101011110010110111110010001111110001111110011111100111111010100101110010001111110001111110011111100111111010111100101101101011110 e47e3f3f3f52e47e3f3f3f5e5be47e3f3f3f52e47e3f3f3f5e5b5e
EUC-JP 艤???R艤???^[艤???R艤???^[^ 111001111101111100111111001111110011111101010010111001111101111100111111001111110011111101011110010110111110011111011111001111110011111100111111010100101110011111011111001111110011111100111111010111100101101101011110 e7df3f3f3f52e7df3f3f3f5e5be7df3f3f3f52e7df3f3f3f5e5b5e
UTF-8 艤쇼렩렡R艤쇼렩렡^[艤쇼렩렡R艤쇼렩렡^[^ 11101000100010011010010011101100100001111011110011101011101000001010100111101011101000001010000101010010111010001000100110100100111011001000011110111100111010111010000010101001111010111010000010100001010111100101101111101000100010011010010011101100100001111011110011101011101000001010100111101011101000001010000101010010111010001000100110100100111011001000011110111100111010111010000010101001111010111010000010100001010111100101101101011110 e889a4ec87bceba0a9eba0a152e889a4ec87bceba0a9eba0a15e5be889a4ec87bceba0a9eba0a152e889a4ec87bceba0a9eba0a15e5b5e
UHC 艤쇼렩렡R艤쇼렩렡^[艤쇼렩렡R艤쇼렩렡^[^ 111010111111101010111100111011101000111010110111100011101011001001010010111010111111101010111100111011101000111010110111100011101011001001011110010110111110101111111010101111001110111010001110101101111000111010110010010100101110101111111010101111001110111010001110101101111000111010110010010111100101101101011110 ebfabcee8eb78eb252ebfabcee8eb78eb25e5bebfabcee8eb78eb252ebfabcee8eb78eb25e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)