To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 儼??泣??懿?????揖?????円?〕揖 1001100101010110001111110011111110001011100000110011111100111111100111001111001000111111001111110011111100111111001111111001011101001011001111110011111100111111001111110011111110001001011111100011111110000001011011001001011101001011 99563f3f8b833f3f9cf23f3f3f3f3f974b3f3f3f3f3f897e3f816c974b
EUC-JP 儼??泣??懿?????揖?????円?〕揖 1101000110110111001111110011111110110101111000110011111100111111110110001111010000111111001111110011111100111111001111111100110110101100001111110011111100111111001111110011111110110001110111110011111110100001110011011100110110101100 d1b73f3fb5e33f3fd8f43f3f3f3f3fcdac3f3f3f3f3fb1df3fa1cdcdac
UTF-8 儼볥슁泣덃퓴懿얠돱黎싲냵揖썸뮄戮녹뒋円ㅻ〕揖 111001011000010010111100111010111011001110100101111011001000101010000001111001101011001110100011111010111000110110000011111011011001001110110100111001101000011110111111111011001001011010100000111010111000111110110001111011111010011010001001111011001000101110110010111010111000001110110101111001101000111110010110111011001000110110111000111010111010111010000100111011111010011110010010111010111000010110111001111010111001001010001011111001011000011010000110111000111000010110111011111000111000000010010101111001101000111110010110 e584bcebb3a5ec8a81e6b3a3eb8d83ed93b4e687bfec96a0eb8fb1efa689ec8bb2eb83b5e68f96ec8db8ebae84efa792eb85b9eb928be58686e385bbe38095e68f96
UHC 儼볥슁泣덃퓴懿얠돱黎싲냵揖썸뮄戮녹뒋円ㅻ〕揖 1110010111110000100100111110101110111101101100111110101111101000100010001110011010111111100110101110101111110011101111101110110010001001101101001110011010110001100110101110101110000110100001011110101111100111101111011110011010010010100100111110101110111101101100111110110010001010100010001110010111110111101001001110101110100001101100111110101111100111 e5f093ebbdb3ebe888e6bf9aebf3beec89b4e6b19aeb8685ebe7bde69293ebbdb3ec8a88e5f7a4eba1b3ebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)