To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?泣f?循?????竊??節??? 1110000110011111100000111000101100111111100010111000001110000010100001100011111110001111011110100011111100111111001111110011111100111111111000101000011000111111001111111001000011011111001111110011111100111111 e19f838b3f8b8382863f8f7a3f3f3f3f3fe2863f3f90df3f3f3f
EUC-JP 癲ル?泣f?循?Œ???竊??節??孼 111000101010000110100101111010110011111110110101111000111010001111100110001111111011110111011011001111111000111110101001101011010011111100111111001111111110001111100110001111110011111111000000111000010011111100111111100011111011101011000011 e2a1a5eb3fb5e3a3e63fbddb3f8fa9ad3f3f3fe3e63f3fc0e13f3f8fbac3
UTF-8 癲ル슢泣f쾮循놁Œ捻믡꺃竊믦돳節뗪턂孼 1110011110011001101100101110001110000011101010111110110010001010101000101110011010110011101000111110111110111101100001101110110010111110101011101110010110111110101010101110101110000110100000011100010110010010111011111010011010100100111010111010111110100001111010101011101010000011111001111010101110001010111010111010111110100110111010111000111110110011111001111010111110000000111010111001011110101010111011011000010010000010111001011010110110111100 e799b2e383abec8aa2e6b3a3efbd86ecbeaee5beaaeb8681c592efa6a4ebafa1eaba83e7ab8aebafa6eb8fb3e7af80eb97aaed8482e5adbc
UHC 癲ル슢泣f쾮循놁Œ捻믡꺃竊믦돳節뗪턂孼 1110111110100110101010111110101110011010101011101110101111101000101000111110011010110010100001011110001011100000100001101110110010101000101010111110011011110111100100101110001110000011101011001110111110111100100100101110100010001001101101101110111110111101100010111110101010110101100111101110010111101101 efa6abeb9aaeebe8a3e6b285e2e086eca8abe6f792e383acefbc92e889b6efbd8beab59ee5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)