To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?依????依?猝^ 00111111100010001100101100111111001111110011111100111111100010001100101100111111111000001100101001011110 3f88cb3f3f3f3f88cb3fe0ca5e
EUC-JP ?依????依?猝^ 00111111101100001100110100111111001111110011111100111111101100001100110100111111111000001100110001011110 3fb0cd3f3f3f3fb0cd3fe0cc5e
UTF-8 렱依렮셸셧렱依렮猝^ 11101011101000001011000111100100101111101001110111101011101000001010111011101100100001011011100011101100100001011010011111101011101000001011000111100100101111101001110111101011101000001010111011100111100011001001110101011110 eba0b1e4be9deba0aeec85b8ec85a7eba0b1e4be9deba0aee78c9d5e
UHC 렱依렮셸셧렱依렮猝^ 10001110101111101110101111101110100011101011101110111100110100001011110011001011100011101011111011101011111011101000111010111011111100001111000101011110 8ebeebee8ebbbcd0bccb8ebeebee8ebbf0f15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)