To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????Þ??????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101111000111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3fde3f3f3f3f3f3f3f
SJIS-WIN ??????袁ъ?猥??遊??柔ロ?億 0011111100111111001111110011111100111111001111111110010111001101100001001000110000111111111000001100111000111111001111111001011101010110001111110011111110001111010111111000001110001101001111111000100110101101 3f3f3f3f3f3fe5cd848c3fe0ce3f3f97563f3f8f5f838d3f89ad
EUC-JP ???靷??袁ъ?猥?Þ遊??柔ロ?億 001111110011111100111111100011111110011110111101001111110011111111101010110011111010011111101100001111111110000011010000001111111000111110101001101100001100110110110111001111110011111110111101110000001010010111101101001111111011001010101111 3f3f3f8fe7bd3f3feacfa7ec3fe0d03f8fa9b0cdb73f3fbdc0a5ed3fb2af
UTF-8 嶺뚮뿫靷숂뙼袁ъ춷猥됰Þ遊꾤춯柔ロ닕億 11101111101001101010101111101011100110101010111011101011101111111010101111101001100111011011011111101100100010001000001011101011100110011011110011101000101000101000000111010001100010101110110010110110101101111110011110001100101001011110101110010000101100001100001110011110111010011000000110001010111010101011111010100100111011001011011010101111111001101001111110010100111000111000001110101101111010111000101110010101111001011000010010000100 efa6abeb9aaeebbfabe99db7ec8882eb99bce8a281d18aecb6b7e78ca5eb90b0c39ee9818aeabea4ecb6afe69f94e383adeb8b95e58484
UHC 嶺뚮뿫靷숂뙼袁ъ춷猥됰Þ遊꾤춯柔ロ닕億 1110011110101101100011001110101110010111101010111110110011100110100110011110011110001100101111111110101010111110101011001110110010101101100100111110100011100101100010011110101110101000101011011110101110110100100001001110011110101101100011001110101011110101101010111110110110001000100110011110010111100010 e7ad8ceb97abece699e78cbfeabeacecad93e8e589eba8adebb484e7ad8ceaf5abed8899e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)