To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 憶??違?????[憶??違?????[^ 10001001101011110011111100111111100010001110000100111111001111110011111100111111001111110101101110001001101011110011111100111111100010001110000100111111001111110011111100111111001111110101101101011110 89af3f3f88e13f3f3f3f3f5b89af3f3f88e13f3f3f3f3f5b5e
EUC-JP 憶??違??洹??[憶??違??洹??[^ 1011001010110001001111110011111110110000111000110011111100111111100011111100011110111010001111110011111101011011101100101011000100111111001111111011000011100011001111110011111110001111110001111011101000111111001111110101101101011110 b2b13f3fb0e33f3f8fc7ba3f3f5bb2b13f3fb0e33f3f8fc7ba3f3f5b5e
UTF-8 憶귣ㅈ違방떤洹잙젚[憶귣ㅈ違방떤洹잙젚[^ 111001101000011010110110111010101011011110100011111000111000010110001000111010011000000110010101111010111011000010101001111010111001011010100100111001101011010010111001111011001001111010011001111011001010000010011010010110111110011010000110101101101110101010110111101000111110001110000101100010001110100110000001100101011110101110110000101010011110101110010110101001001110011010110100101110011110110010011110100110011110110010100000100110100101101101011110 e686b6eab7a3e38588e98195ebb0a9eb96a4e6b4b9ec9e99eca09a5be686b6eab7a3e38588e98195ebb0a9eb96a4e6b4b9ec9e99eca09a5b5e
UHC 憶귣ㅈ違방떤洹잙젚[憶귣ㅈ違방떤洹잙젚[^ 111001011110001110000010111010111010010010111000111010101101111010111001111001101011011010110010111010101011011110011111111010111010000010010110010110111110010111100011100000101110101110100100101110001110101011011110101110011110011010110110101100101110101010110111100111111110101110100000100101100101101101011110 e5e382eba4b8eadeb9e6b6b2eab79feba0965be5e382eba4b8eadeb9e6b6b2eab79feba0965b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)