To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??)????ャ?勇??肄ょ?猷??堰 0011111100111111100000010110101000111111001111110011111100111111100000111000001100111111100101110100010100111111001111111110001111100101100000101110010100111111100101110101000100111111001111111000100110000001 3f3f816a3f3f3f3f83833f97453f3fe3e582e53f97513f3f8981
EUC-JP ??)絪???ャ?勇??肄ょ?猷??堰 00111111001111111010000111001011100011111101001111101100001111110011111100111111101001011110001100111111110011011010011000111111001111111110011011100111101001001110011100111111110011011011001000111111001111111011000111100001 3f3fa1cb8fd3ec3f3f3fa5e33fcda63f3fe6e7a4e73fcdb23f3fb1e1
UTF-8 念잙)絪밭넭戮ャ궕勇싳떑肄ょ삜猷⑺겧堰 111011111010011010100011111011001001111010011001111011111011110010001001111001111011010110101010111010111011000010101101111010111000010010101101111011111010011110010010111000111000001110100011111010101011011010010101111001011000101110000111111011001000101110110011111010111001011010010001111010001000001010000100111000111000001010000111111011001000001010011100111001111000110010110111111000101001000110111010111010101011001010100111111001011010000010110000 efa6a3ec9e99efbc89e7b5aaebb0adeb84adefa792e383a3eab695e58b87ec8bb3eb9691e88284e38287ec829ce78cb7e291baeab2a7e5a0b0
UHC 念잙)絪밭넭戮ャ궕勇싳떑肄ょ삜猷⑺겧堰 1110011011110110100111111110101110100011101010011110110011011111101110011110011110000110101011001110101110111101101010111110001110000010101010101110100110111000100110101110110010001011101001111110110010111101101010101110011110011000100111111110101110100011101010011110110110000001101110011110010111101000 e6f69feba3a9ecdfb9e786acebbdabe382aae9b89aec8ba7ecbdaae7989feba3a9ed81b9e5e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)