To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?λ‘???儒?????愉ょ?濡??違 0011111110000011110010011000000101100101001111110011111100111111100011101111001000111111001111110011111100111111001111111001011011111001100000101110010100111111100101000100011100111111001111111000100011100001 3f83c981653f3f3f8ef23f3f3f3f3f96f982e53f94473f3f88e1
EUC-JP ?λ‘瑗??儒????Ŋ愉ょ?濡??違 001111111010011011001011101000011100011010001111110011001100000000111111001111111011110011110100001111110011111100111111001111111000111110101001101010111100110011111011101001001110011100111111110001111010100000111111001111111011000011100011 3fa6cba1c68fccc03f3fbcf43f3f3f3f8fa9abccfba4e73fc7a83f3fb0e3
UTF-8 列λ‘瑗삯렟儒쎌첌亮쇰Ŋ愉ょ뙠濡뗪퍊違 11101111101001101001110011001110101110111110001010000000100110001110011110010001100101111110110010000010101011111110101110100000100111111110010110000100100100101110110010001110100011001110110010110010100011001110111110100101101101111110110010000111101100001100010110001010111001101000010010001001111000111000001010000111111010111001100110100000111001101011111110100001111010111001011110101010111011011000110110001010111010011000000110010101 efa69ccebbe28098e79197ec82afeba09fe58492ec8e8cecb28cefa5b7ec87b0c58ae68489e38287eb99a0e6bfa1eb97aaed8d8ae98195
UHC 列λ‘瑗삯렟儒쎌첌亮쇰Ŋ愉ょ뙠濡뗪퍊違 1110011011101010101001011110101110100001101011101110101010111100101110111110100110001110101100001110101011100011101111011110110010101010100110011110010110111001101111001110101110101000101011111110101011110000101010101110011110001100101001011110101110100001100010111110101010111011100000011110101011011110 e6eaa5eba1aeeabcbbe98eb0eae3bdecaa99e5b9bceba8afeaf0aae78ca5eba18beabb81eade

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)