To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????G?????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110100011100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f473f3f3f3f3f3f3f3f3f3f
SJIS-WIN 篠セナ璽偲治篠湿篠竺G篠セナ璽篠識篠湿篠識 10001110110000101011111011000101100011101010001110001110110000111000111010100001100011101100001010001110101111001000111011000010100011101011000101000111100011101100001010111110110001011000111010100011100011101100001010001110101011111000111011000010100011101011110010001110110000101000111010101111 8ec2bec58ea38ec38ea18ec28ebc8ec28eb1478ec2bec58ea38ec28eaf8ec28ebc8ec28eaf
EUC-JP 篠セナ璽偲治篠湿篠竺G篠セナ璽篠識篠湿篠識 1011110011000100100011101011111010001110110001011011110010100101101111001100010110111100101000111011110011000100101111001011111010111100110001001011110010110011010001111011110011000100100011101011111010001110110001011011110010100101101111001100010010111100101100011011110011000100101111001011111010111100110001001011110010110001 bcc48ebe8ec5bca5bcc5bca3bcc4bcbebcc4bcb347bcc48ebe8ec5bca5bcc4bcb1bcc4bcbebcc4bcb1
UTF-8 篠セナ璽偲治篠湿篠竺G篠セナ璽篠識篠湿篠識 11100111101011111010000011101111101111011011111011101111101111101000010111100111100100101011110111100101100000011011001011100110101100101011101111100111101011111010000011100110101110011011111111100111101011111010000011100111101010111011101001000111111001111010111110100000111011111011110110111110111011111011111010000101111001111001001010111101111001111010111110100000111010001010110110011000111001111010111110100000111001101011100110111111111001111010111110100000111010001010110110011000 e7afa0efbdbeefbe85e792bde581b2e6b2bbe7afa0e6b9bfe7afa0e7abba47e7afa0efbdbeefbe85e792bde7afa0e8ad98e7afa0e6b9bfe7afa0e8ad98
UHC 篠??璽?治篠?篠竺G篠??璽篠識篠?篠識 11100001110001100011111100111111110111111101111000111111111101101011110111100001110001100011111111100001110001101111010111100111010001111110000111000110001111110011111111011111110111101110000111000110111000111101101111100001110001100011111111100001110001101110001111011011 e1c63f3fdfde3ff6bde1c63fe1c6f5e747e1c63f3fdfdee1c6e3dbe1c63fe1c6e3db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)