To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲щ?誼?━喩??B 111000011001111110000100100010110011111110001011011000100011111110000100101010101001101001100111001111110011111101000010 e19f848b3f8b623f84aa9a673f3f42
EUC-JP 癲щ?誼?━喩??B 111000101010000110100111111010110011111110110101110000110011111110101000101011001101001111001000001111110011111101000010 e2a1a7eb3fb5c33fa8acd3c83f3f42
UTF-8 癲щ돆誼믭━喩쀬젔B 111001111001100110110010110100011000100111101011100011111000011011101000101010101011110011101011101011111010110111100010100101001000000111100101100101101010100111101100100000001010110011101100101000001001010001000010 e799b2d189eb8f86e8aabcebafade29481e596a9ec80aceca09442
UHC 癲щ돆誼믭━喩쀬젔B 11101111101001101010110011101011100010011001011111101011111111101001001011101111101001101010110011101010111001111001011111101100101000001001001001000010 efa6aceb8997ebfe92efa6aceae797eca09242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)