To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??泣??攸??癲??泣???⑥?哀 1000101001111000001111110011111110001011100000110011111100111111100111011011111100111111001111111110000110011111001111110011111110001011100000110011111100111111001111111000011101000101001111111000100010100011 8a783f3f8b833f3f9dbf3f3fe19f3f3f8b833f3f3f87453f88a3
EUC-JP 岳??泣??攸??癲??泣?ˇ洹??哀 1011001111011001001111110011111110110101111000110011111100111111110110101100000100111111001111111110001010100001001111110011111110110101111000110011111110001111101000101011000010001111110001111011101000111111001111111011000010100101 b3d93f3fb5e33f3fdac13f3fe2a13f3fb5e33f8fa2b08fc7ba3f3fb0a5
UTF-8 岳됰냲泣앮뿿攸됱뒃癲삳낑泣됪ˇ洹⑥돖哀 1110010110110010101100111110101110010000101100001110101110000011101100101110011010110011101000111110110010010101101011101110101110111111101111111110011010010100101110001110101110010000101100011110101110010010100000111110011110011001101100101110110010000010101100111110101110000010100100011110011010110011101000111110101110010000101010101100101110000111111001101011010010111001111000101001000110100101111010111000111110010110111001011001001110000000 e5b2b3eb90b0eb83b2e6b3a3ec95aeebbfbfe694b8eb90b1eb9283e799b2ec82b3eb8291e6b3a3eb90aacb87e6b4b9e291a5eb8f96e59380
UHC 岳됰냲泣앮뿿攸됱뒃癲삳낑泣됪ˇ洹⑥돖哀 1110010010111111100010011110101110000110100000101110101111101000100111011110011010010111101111111110101011110010100010011110110010001010100000011110111110100110101110111110101110110011101010011110101111101000100010011110011010100010101001111110101010110111101010001110110010001001101000001110010011101110 e4bf89eb8682ebe89de697bfeaf289ec8a81efa6bbebb3a9ebe889e6a2a7eab7a8ec89a0e4ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)