To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??松⑨?檍??猷ロ?受k6韋??? 1001011101010001001111110011111110001111101111001000011101001000001111111001111011111000001111110011111110010111010100011000001110001101001111111000111011110011100000101000101110000010010101011110100011101000001111110011111100111111 97513f3f8fbc87483f9ef83f3f9751838d3f8ef3828b8255e8e83f3f3f
EUC-JP 猷??松??檍??猷ロ?受k6韋??? 11001101101100100011111100111111101111101011111000111111001111111101110011111010001111110011111111001101101100101010010111101101001111111011110011110101101000111110101110100011101101101111000011101010001111110011111100111111 cdb23f3fbebe3f3fdcfa3f3fcdb2a5ed3fbcf5a3eba3b6f0ea3f3f3f
UTF-8 猷띠툞松⑨쫩檍덀궠猷ロ삨受k6韋쇘왋理 111001111000110010110111111010111001110110100000111011011000100010011110111001101001110110111110111000101001000110101000111011001010101110101001111001101010101010001101111010111000110110000000111010101011011010100000111001111000110010110111111000111000001110101101111011001000001010101000111001011000111110010111111011111011110110001011111011111011110010010110111010011001111110001011111011001000011110011000111011001001100110001011111011111010011110100100 e78cb7eb9da0ed889ee69dbee291a8ecaba9e6aa8deb8d80eab6a0e78cb7e383adec82a8e58f97efbd8befbc96e99f8bec8798ec998befa7a4
UHC 猷띠툞松⑨쫩檍덀궠猷ロ삨受k6韋쇘왋理 1110101110100011101101101110110010111000100101011110000111100110101010001110111110100110100000101110010111100101100010001110001110000010101100111110101110100011101010111110110110011000101001111110000111110100101000111110101110100011101101101110101011011111101111001110011110011110101111001110110010110101 eba3b6ecb895e1e6a8efa682e5e588e382b3eba3abed98a7e1f4a3eba3b6eadfbce79ebcecb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)