To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 荼逸集荐件洲荼逸集蔗 1110010010110110100010001110110110001111010101111110010010100110100011001000111110001111010001101110010010110110100010001110110110001111010101111110010011110010 e4b688ed8f57e4a68c8f8f46e4b688ed8f57e4f2
EUC-JP 荼逸集荐件洲荼逸集蔗 1110100010111000101100001110111110111101101110001110100010101000101101111110111110111101101001111110100010111000101100001110111110111101101110001110100011110100 e8b8b0efbdb8e8a8b7efbda7e8b8b0efbdb8e8f4
UTF-8 荼逸集荐件洲荼逸集蔗 111010001000110110111100111010011000000010111000111010011001101110000110111010001000110110010000111001001011101110110110111001101011010010110010111010001000110110111100111010011000000010111000111010011001101110000110111010001001010010010111 e88dbce980b8e99b86e88d90e4bbb6e6b4b2e88dbce980b8e99b86e89497
UHC ?逸集?件洲?逸集蔗 0011111111101100111011111111001110100010001111111100101111101100111100011011110100111111111011001110111111110011101000101110110110111101 3feceff3a23fcbecf1bd3feceff3a2edbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)