To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 髫俶ァォ豸オ隴エ謚訴髫俶ァォ豸オ隴エ謚訴B 111010011001101010011000111001101010011110101011111001101011011010110101111010001010110110110100111001101000101010010001011010011110100110011010100110001110011010100111101010111110011010110110101101011110100010101101101101001110011010001010100100010110100101000010 e99a98e6a7abe6b6b5e8adb4e68a9169e99a98e6a7abe6b6b5e8adb4e68a916942
EUC-JP 髫俶ァォ豸オ隴エ謚訴髫俶ァォ豸オ隴エ謚訴B 1111000111111010110100001110100010001110101001111000111010101011111011001011100010001110101101011111000010101111100011101011010011101011111010101100000111001010111100011111101011010000111010001000111010100111100011101010101111101100101110001000111010110101111100001010111110001110101101001110101111101010110000011100101001000010 f1fad0e88ea78eabecb88eb5f0af8eb4ebeac1caf1fad0e88ea78eabecb88eb5f0af8eb4ebeac1ca42
UTF-8 髫俶ァォ豸オ隴エ謚訴髫俶ァォ豸オ隴エ謚訴B 11101001101010111010101111100100101111111011011011101111101111011010011111101111101111011010101111101000101100011011100011101111101111011011010111101001100110101011010011101111101111011011010011101000101011001001101011101000101010001011010011101001101010111010101111100100101111111011011011101111101111011010011111101111101111011010101111101000101100011011100011101111101111011011010111101001100110101011010011101111101111011011010011101000101011001001101011101000101010001011010001000010 e9ababe4bfb6efbda7efbdabe8b1b8efbdb5e99ab4efbdb4e8ac9ae8a8b4e9ababe4bfb6efbda7efbdabe8b1b8efbdb5e99ab4efbdb4e8ac9ae8a8b442
UHC ????????謚訴????????謚訴B 00111111001111110011111100111111001111110011111100111111001111111110110011010000111000011100110100111111001111110011111100111111001111110011111100111111001111111110110011010000111000011100110101000010 3f3f3f3f3f3f3f3fecd0e1cd3f3f3f3f3f3f3f3fecd0e1cd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)