To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?級援??碎??藥?????遺?? 0011111100111111001111111000101110000011100000011010100000111111100010111000100110001001100001110011111100111111111000011110101000111111001111111110010101011010001111110011111100111111001111110011111110001000111000100011111100111111 3f3f3f8b8381a83f8b8989873f3fe1ea3f3fe55a3f3f3f3f3f88e23f3f
EUC-JP ???泣→?級援??碎??藥?????遺?? 0011111100111111001111111011010111100011101000101010101000111111101101011110100110110001111001110011111100111111111000101110110000111111001111111110100110111011001111110011111100111111001111110011111110110000111001000011111100111111 3f3f3fb5e3a2aa3fb5e9b1e73f3fe2ec3f3fe9bb3f3f3f3f3fb0e43f3f
UTF-8 捻꿔끇泣→쨫級援쒙쭜碎밸뤊藥띲꺂理묊땟遺밴강 111011111010011010100100111010101011111110010100111010111000000110000111111001101011001110100011111000101000011010010010111011001010100010101011111001111011010010011010111001101000111110110100111011001001001010011001111011001010110110011100111001111010001010001110111010111011000010111000111010111010010010001010111010001001011110100101111010111001110110110010111010101011101010000010111011111010011110100100111010111010110010001010111010111001010110011111111010011000000110111010111010111011000010110100111010101011000010010101 efa6a4eabf94eb8187e6b3a3e28692eca8abe7b49ae68fb4ec9299ecad9ce7a28eebb0b8eba48ae897a5eb9db2eaba82efa7a4ebac8aeb959fe981baebb0b4eab095
UHC 捻꿔끇泣→쨫級援쒙쭜碎밸뤊藥띲꺂理묊땟遺밴강 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011101000011100100111010101011010110011100111011111010011110010010111000011110111110111001111010111000111110111010111001011011011110001101111000111000001110101011111011001011010110010001111001111011011010101101111010111011011010111001111010101011000010101101 e6f7b2e385bbebe8a1e6a485d0e4eab59cefa792e1efb9eb8fbae5b78de383abecb591e7b6adebb6b9eab0ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)