To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?級????λ?藥??靭??碎?? 0011111100111111001111111000101110000011100000011010100000111111100010111000100100111111001111110011111100111111100000111100100100111111111001010101101000111111001111111001000001111000001111110011111111100001111010100011111100111111 3f3f3f8b8381a83f8b893f3f3f3f83c93fe55a3f3f90783f3fe1ea3f3f
EUC-JP ???泣→?級???洹λ?藥??靭??碎?? 00111111001111110011111110110101111000111010001010101010001111111011010111101001001111110011111100111111100011111100011110111010101001101100101100111111111010011011101100111111001111111011111111011001001111110011111111100010111011000011111100111111 3f3f3fb5e3a2aa3fb5e93f3f3f8fc7baa6cb3fe9bb3f3fbfd93f3fe2ec3f3f
UTF-8 捻꿔끇泣→쨫級吏녶뭣洹λ뤊藥띲꺁靭덃쾮碎좏맋 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110011110110100100110101110111110100111100111101110101110000101101101101110101110101101101000111110011010110100101110011100111010111011111010111010010010001010111010001001011110100101111010111001110110110010111010101011101010000001111010011001110110101101111010111000110110000011111011001011111010101110111001111010001010001110111011001010001010001111111010111010011110001011 efa6a4eabf94eb8187e6b3a3e28692eca8abe7b49aefa79eeb85b6ebada3e6b4b9cebbeba48ae897a5eb9db2eaba81e99dadeb8d83ecbeaee7a28eeca28feba78b
UHC 捻꿔끇泣→쨫級吏녶뭣洹λ뤊藥띲꺁靭덃쾮碎좏맋 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011101000011100100111011001010011110000110111001011011100110111101111010101011011110100101111010111000111110111010111001011011011110001101111000111000001110101010111011001110010110001000111001101011001010000101111000011110111110100000111011011001000010100011 e6f7b2e385bbebe8a1e6a485d0e4eca786e5b9bdeab7a5eb8fbae5b78de383aaece588e6b285e1efa0ed90a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)