To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????GB 00111111001111110011111100111111001111110011111100111111001111110100011101000010 3f3f3f3f3f3f3f3f4742
SJIS-WIN 篠耳偲室篠爾偲而GB 100011101100001010001110101010001000111011000011100011101011101010001110110000101000111010100010100011101100001110001110101001110100011101000010 8ec28ea88ec38eba8ec28ea28ec38ea74742
EUC-JP 篠耳偲室篠爾偲而GB 101111001100010010111100101010101011110011000101101111001011110010111100110001001011110010100100101111001100010110111100101010010100011101000010 bcc4bcaabcc5bcbcbcc4bca4bcc5bca94742
UTF-8 篠耳偲室篠爾偲而GB 1110011110101111101000001110100010000000101100111110010110000001101100101110010110101110101001001110011110101111101000001110011110001000101111101110010110000001101100101110100010000000100011000100011101000010 e7afa0e880b3e581b2e5aea4e7afa0e788bee581b2e8808c4742
UHC 篠耳?室篠爾?而GB 11100001110001101110110010111100001111111110001111111000111000011100011011101100101100110011111111101100101110110100011101000010 e1c6ecbc3fe3f8e1c6ecb33fecbb4742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)