To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??泣??誘щ? 10001001011010010011111100111111100010111000001100111111001111111001011101010101100001001000101100111111 89693f3f8b833f3f9755848b3f
EUC-JP 永??泣??誘щ? 10110001110010100011111100111111101101011110001100111111001111111100110110110110101001111110101100111111 b1ca3f3fb5e33f3fcdb6a7eb3f
UTF-8 永띕냵泣섊독誘щ닚 1110011010110000101110001110101110011101100101011110101110000011101101011110011010110011101000111110110010000100100010101110101110001111100001011110100010101010100110001101000110001001111010111000101110011010 e6b0b8eb9d95eb83b5e6b3a3ec848aeb8f85e8aa98d189eb8b9a
UHC 永띕냵泣섊독誘щ닚 111001111011010110110110111010111000011010000101111010111110100010011000111001111011010110110110111010111010111110101100111010111000100010011100 e7b5b6eb8685ebe898e7b5b6ebafaceb889c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)