To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??猷??膺?? 100010010100011100111111001111111001011101010001001111110011111111100100010111100011111100111111 89473f3f97513f3fe45e3f3f
EUC-JP 烏??猷??膺?? 101100011010100000111111001111111100110110110010001111110011111111100111101111110011111100111111 b1a83f3fcdb23f3fe7bf3f3f
UTF-8 烏띻퀣猷됧뵱膺얠퍥 111001111000001110001111111010111001110110111011111011011000000010100011111001111000110010110111111010111001000010100111111010111011010110110001111010001000011010111010111011001001011010100000111011011000110110100101 e7838feb9dbbed80a3e78cb7eb90a7ebb5b1e886baec96a0ed8da5
UHC 烏띻퀣猷됧뵱膺얠퍥 111010001010000110001101111010101011001110010111111010111010001110001001111001011001010010101111111010111110110010111110111011001011101110011100 e8a18deab397eba389e594afebecbeecbb9c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)