To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 沃??毅??珥?い怨?????B 100101111000000000111111001111111000101101000010001111110011111111100000111000000011111110000010101000101000100110000101001111110011111100111111001111110011111101000010 97803f3f8b423f3fe0e03f82a289853f3f3f3f3f42
EUC-JP 沃??毅??珥?い怨??孼??B 1100110111100000001111110011111110110101101000110011111100111111111000001110001000111111101001001010010010110001111001010011111100111111100011111011101011000011001111110011111101000010 cde03f3fb5a33f3fe0e23fa4a4b1e53f3f8fbac33f3f42
UTF-8 沃ㅺ낯毅싨룚珥껇い怨몄삖孼뽰큳B 11100110101100101000001111100011100001011011101011101011100000101010111111100110101011111000010111101100100010111010100011101011101000111001101011100111100011111010010111101010101110111000011111100011100000011000010011100110100000001010100011101011101010101000010011101100100000101001011011100101101011011011110011101011101111011011000011101101100000011011001101000010 e6b283e385baeb82afe6af85ec8ba8eba39ae78fa5eabb87e38184e680a8ebaa84ec8296e5adbcebbdb0ed81b342
UHC 沃ㅺ낯毅싨룚珥껇い怨몄삖孼뽰큳B 11101000101010101010010011101010101100111011100011101011111101101001101011100110100011111001011011101100101101001000001111101000101010101010010011101010101100111011100011101100100110001001101011100101111011011001011011101100101101001000001101000010 e8aaa4eab3b8ebf69ae68f96ecb483e8aaa4eab3b8ec989ae5ed96ecb48342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)