To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??萸??鎖???ル?喩??艶k∥猷 10011000110110100011111100111111111001001100111000111111001111111000110110111101001111110011111100111111100000111000101100111111100110100110011100111111001111111000100110010000100000101000101110000001011000011001011101010001 98da3f3fe4ce3f3f8dbd3f3f3f838b3f9a673f3f8990828b81619751
EUC-JP 俑??萸??鎖???ル?喩??艶k‖猷 11010000110111000011111100111111111010001101000000111111001111111011101010111111001111110011111100111111101001011110101100111111110100111100100000111111001111111011000111110000101000111110101110100001110000101100110110110010 d0dc3f3fe8d03f3fbabf3f3f3fa5eb3fd3c83f3fb1f0a3eba1c2cdb2
UTF-8 俑앹늿萸썸뤃鎖듬겱曆ル릧喩볦쓤艶k∥猷 111001001011111110010001111011001001010110111001111010111000101010111111111010001001000010111000111011001000110110111000111010111010010010000011111010011000111010010110111010111001001110101100111010101011001010110001111011111010011010001011111000111000001110101011111010111010011010100111111001011001011010101001111010111011001110100110111011001001001110100100111010001000100110110110111011111011110110001011111000101000100010100101111001111000110010110111 e4bf91ec95b9eb8abfe890b8ec8db8eba483e98e96eb93aceab2b1efa68be383abeba6a7e596a9ebb3a6ec93a4e889b6efbd8be288a5e78cb7
UHC 俑앹늿萸썸뤃鎖듬겱曆ル릧喩볦쓤艶k∥猷 1110100110110101100111011110110010001000100010001110101110101101101111011110011010001111101101001110000111110000101101011110101110000001101111011110011010110111101010111110101110010000100010011110101011100111100100111110110010011101100001011110011011111101101000111110101110100001101010111110101110100011 e9b59dec8888ebadbde68fb4e1f0b5eb81bde6b7abeb9089eae793ec9d85e6fda3eba1abeba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)