To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鳶??維??凝??儒??裔??B 100100111100111000111111001111111000100011011011001111110011111110001011110000110011111100111111100011101111001000111111001111111110010111100001001111110011111101000010 93ce3f3f88db3f3f8bc33f3f8ef23f3fe5e13f3f42
EUC-JP 鳶??維??凝??儒??裔??B 110001101101000000111111001111111011000011011101001111110011111110110110110001010011111100111111101111001111010000111111001111111110101011100011001111110011111101000010 c6d03f3fb0dd3f3fb6c53f3fbcf43f3feae33f3f42
UTF-8 鳶롫뀽維믦굜凝사땸儒노쳮裔꾩뙸B 11101001101100111011011011101011101000011010101111101011100000001011110111100111101101101010110111101011101011111010011011101010101101011001110011100101100001111001110111101100100000101010110011101011100101011011100011100101100001001001001011101011100001011011100011101100101100111010111011101000101000111001010011101010101111101010100111101011100110011011100001000010 e9b3b6eba1abeb80bde7b6adebafa6eab59ce5879dec82aceb95b8e58492eb85b8ecb3aee8a394eabea9eb99b842
UHC 鳶롫뀽維믦굜凝사땸儒노쳮裔꾩뙸B 11100110111010011000111011101011100001011011001111101011101010111001001011101000100000101000010011101011111010101011101111100111100010111000111011101010111000111011001111101011101010111001001011100111111000001000010011101100100011001011101101000010 e6e98eeb85b3ebab92e88284ebeabbe78b8eeae3b3ebab92e7e084ec8cbb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)