To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭??幼??壓??苡??惟????? 0011111100111111001111111001000001111000001111110011111110010111011000110011111100111111100110101101100000111111001111111110010010001111001111110011111110001000110100100011111100111111001111110011111100111111 3f3f3f90783f3f97633f3f9ad83f3fe48f3f3f88d23f3f3f3f3f
EUC-JP ???靭??幼??壓??苡??惟????? 0011111100111111001111111011111111011001001111110011111111001101110001000011111100111111110101001101101000111111001111111110011111101111001111110011111110110000110101000011111100111111001111110011111100111111 3f3f3fbfd93f3fcdc43f3fd4da3f3fe7ef3f3fb0d43f3f3f3f3f
UTF-8 麗몃쓷靭뚩땻幼먥뵦壓믩뜃苡김죲惟곕쭦歷몄쾿 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010101001111010111001010110111011111001011011100110111100111010111010100010100101111010111011010110100110111001011010001110010011111010111010111110101001111010111001110010000011111010001000101110100001111010101011100110000000111011001010001110110010111001101000001110011111111010101011001110010101111011001010110110100110111011111010011010001100111010111010101010000100111011001011111010111111 efa688ebaa83ec93b7e99dadeb9aa9eb95bbe5b9bceba8a5ebb5a6e5a393ebafa9eb9c83e88ba1eab980eca3b2e6839feab395ecada6efa68cebaa84ecbebf
UHC 麗몃쓷靭뚩땻幼먥뵦壓믩뜃苡김죲惟곕쭦歷몄쾿 111001101011000010111000111010111001110110010100111011001110010110001100111010001000101110010001111010101110101010010000111000101001010010100101111001001110001010010010111010111000110110000111111011001011111010110001111010001010000110001101111010101110111010110000111010111010011110011010111001101011100010111000111011001011001010010101 e6b0b8eb9d94ece58ce88b91eaea90e294a5e4e292eb8d87ecbeb1e8a18deaeeb0eba79ae6b8b8ecb295

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)