To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壓??矣??厄μ?鍮??癒ш?阿??愉 10011010110110000011111100111111111000011110000100111111001111111001011011101111100000111100101000111111111010000100101000111111001111111001011011111100100001001000101000111111100010001010001000111111001111111001011011111001 9ad83f3fe1e13f3f96ef83ca3fe84a3f3f96fc848a3f88a23f3f96f9
EUC-JP 壓??矣??厄μ?鍮??癒ш?阿??愉 11010100110110100011111100111111111000101110001100111111001111111100110011110001101001101100110000111111111011111010101100111111001111111100110011111110101001111110101000111111101100001010010000111111001111111100110011111011 d4da3f3fe2e33f3fccf1a6cc3fefab3f3fccfea7ea3fb0a43f3fccfb
UTF-8 壓쇰꼬矣곕뮟厄μ떝鍮€룚癒ш굉阿숈눘愉 11100101101000111001001111101100100001111011000011101010101111001010110011100111100111111010001111101010101100111001010111101011101011101001111111100101100011101000010011001110101111001110101110010110100111011110100110001101101011101110001010000010101011001110101110100011100110101110011110011001100100101101000110001000111010101011010110001001111010011001100010111111111011001000100010001000111010111000100010011000111001101000010010001001 e5a393ec87b0eabcace79fa3eab395ebae9fe58e84cebceb969de98daee282aceba39ae79992d188eab589e998bfec8888eb8898e68489
UHC 壓쇰꼬矣곕뮟厄μ떝鍮€룚癒ш굉阿숈눘愉 1110010011100010101111001110101110110010101111111110101111111000101100001110101110010010101010111110010011111000101001011110110010001011101100111110101110111001101000101110011010001111100101101110101110101000101011001110101010110001101100101110010010111001100110011110110010000111101100011110101011110000 e4e2bcebb2bfebf8b0eb92abe4f8a5ec8bb3ebb9a2e68f96eba8aceab1b2e4b999ec87b1eaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)