To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 靖???烝???紙?靖???烝???紙?B 100101101111010100111111001111110011111111100000011111100011111100111111001111111000111010000110001111111001011011110101001111110011111100111111111000000111111000111111001111110011111110001110100001100011111101000010 96f53f3f3fe07e3f3f3f8e863f96f53f3f3fe07e3f3f3f8e863f42
EUC-JP 靖???烝???紙?靖???烝???紙?B 110011001111011100111111001111110011111111011111110111110011111100111111001111111011101111100110001111111100110011110111001111110011111100111111110111111101111100111111001111110011111110111011111001100011111101000010 ccf73f3f3fdfdf3f3f3fbbe63fccf73f3f3fdfdf3f3f3fbbe63f42
UTF-8 靖ㆁ렰렕烝렱吏렋紙렔靖ㆁ렰렕烝렱吏렋紙렔B 11101001100111011001011011100011100001101000000111101011101000001011000011101011101000001001010111100111100000111001110111101011101000001011000111101111101001111001111011101011101000001000101111100111101101001001100111101011101000001001010011101001100111011001011011100011100001101000000111101011101000001011000011101011101000001001010111100111100000111001110111101011101000001011000111101111101001111001111011101011101000001000101111100111101101001001100111101011101000001001010001000010 e99d96e38681eba0b0eba095e7839deba0b1efa79eeba08be7b499eba094e99d96e38681eba0b0eba095e7839deba0b1efa79eeba08be7b499eba09442
UHC 靖ㆁ렰렕烝렱吏렋紙렔靖ㆁ렰렕烝렱吏렋紙렔B 1110111111111110101001001111000110001110101111011000111010101010111100011111011010001110101111101110110010100111100011101010001011110010101101011000111010101001111011111111111010100100111100011000111010111101100011101010101011110001111101101000111010111110111011001010011110001110101000101111001010110101100011101010100101000010 effea4f18ebd8eaaf1f68ebeeca78ea2f2b58ea9effea4f18ebd8eaaf1f68ebeeca78ea2f2b58ea942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)