To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖??油???ル“違??恂れ?冶 001111110011111100111111100101110100101100111111001111111001011011111011001111110011111100111111100000111000101110000001011001111000100011100001001111110011111110011100100101101000001011101010001111111001011011101000 3f3f3f974b3f3f96fb3f3f3f838b816788e13f3f9c9682ea3f96e8
EUC-JP ???揖??油???ル“違??恂れ?冶 001111110011111100111111110011011010110000111111001111111100110011111101001111110011111100111111101001011110101110100001110010001011000011100011001111110011111111010111111101101010010011101100001111111100110011101010 3f3f3fcdac3f3fccfd3f3f3fa5eba1c8b0e33f3fd7f6a4ec3fccea
UTF-8 嶺뚮슣揖㎩쳞油몄뒴曆ル“違깍쫳恂れ뫊冶 111011111010011010101011111010111001101010101110111011001000101010100011111001101000111110010110111000111000111010101001111011001011001110011110111001101011001010111001111010111010101010000100111010111001001010110100111011111010011010001011111000111000001110101011111000101000000010011100111010011000000110010101111010101011100110001101111011001010101110110011111001101000000110000010111000111000001010001100111010111010101110001010111001011000011010110110 efa6abeb9aaeec8aa3e68f96e38ea9ecb39ee6b2b9ebaa84eb92b4efa68be383abe2809ce98195eab98decabb3e68182e3828cebab8ae586b6
UHC 嶺뚮슣揖㎩쳞油몄뒴曆ル“違깍쫳恂れ뫊冶 1110011110101101100011001110101110011010101011111110101111100111101001111110010110101011100001001110101011111010101110001110110010001010101011011110011010110111101010111110101110100001101100001110101011011110101100011110111110100110100010111110001011100001101010101110110010010001101011001110010110100111 e7ad8ceb9aafebe7a7e5ab84eafab8ec8aade6b7abeba1b0eadeb1efa68be2e1aaec91ace5a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)