To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鄒夂イ狗柄鄒壽ーエ蛛取エェ閼セ 111001111011111010011010111001111011001010001011111001111001010110111111111001111011111010011010111001101011000010110100111001011000000110001110111001101011010010101010111010001000010010111110 e7be9ae7b28be795bfe7be9ae6b0b4e5818ee6b4aae884be
EUC-JP 鄒夂イ狗柄鄒壽ーエ蛛取エェ閼セ 111011101100000011010100111010011000111010110010101101101110100111001010110000011110111011000000110101001110100010001110101100001000111010110100111010011110000110111100111010001000111010110100100011101010101011101111111001001000111010111110 eec0d4e98eb2b6e9cac1eec0d4e88eb08eb4e9e1bce88eb48eaaefe48ebe
UTF-8 鄒夂イ狗柄鄒壽ーエ蛛取エェ閼セ 111010011000010010010010111001011010010010000010111011111011110110110010111001111000101110010111111001101001111110000100111010011000010010010010111001011010001110111101111011111011110110110000111011111011110110110100111010001001101110011011111001011000111110010110111011111011110110110100111011111011110110101010111010011001011010111100111011111011110110111110 e98492e5a482efbdb2e78b97e69f84e98492e5a3bdefbdb0efbdb4e89b9be58f96efbdb4efbdaae996bcefbdbe
UHC 鄒??狗柄鄒壽??蛛取??閼? 1111010111011011001111110011111111001111101101111101110010110111111101011101101111100001111110000011111100111111111100011100100011110110101000100011111100111111111001001101100100111111 f5db3f3fcfb7dcb7f5dbe1f83f3ff1c8f6a23f3fe4d93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)