To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?????衣??矣??熱??B 1110000110011111001111110011111100111111001111110011111110001000110111110011111100111111111000011110000100111111001111111001010001001101001111110011111101000010 e19f3f3f3f3f3f88df3f3fe1e13f3f944d3f3f42
EUC-JP 癲?????衣??矣??熱??B 1110001010100001001111110011111100111111001111110011111110110000111000010011111100111111111000101110001100111111001111111100011110101110001111110011111101000010 e2a13f3f3f3f3fb0e13f3fe2e33f3fc7ae3f3f42
UTF-8 癲띿슜杻⑴뮫衣㏆쫶矣⑸뼦熱듭윧B 11100111100110011011001011101011100111011011111111101100100010101001110011101111101001111000100011100010100100011011010011101011101011101010101111101000101000011010001111100011100011111000011011101100101010111011011011100111100111111010001111100010100100011011100011101011101111001010011011100111100001101011000111101011100100111010110111101100100111001010011101000010 e799b2eb9dbfec8a9cefa788e291b4ebaeabe8a1a3e38f86ecabb6e79fa3e291b8ebbca6e786b1eb93adec9ca742
UHC 癲띿슜杻⑴뮫衣㏆쫶矣⑸뼦熱듭윧B 11101111101001101000110111101100100110101010100111101010111101001010100111100111100100101011010111101011111111011010011111101111101001101000110111101011111110001010100111101011100101101010100111100110111100001011010111101100100111111010011101000010 efa68dec9aa9eaf4a9e792b5ebfda7efa68debf8a9eb96a9e6f0b5ec9fa742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)