To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌??湲???????筌??湲???????^ 111000101010001100111111001111111001111111010001001111110011111100111111001111110011111100111111001111111110001010100011001111110011111110011111110100010011111100111111001111110011111100111111001111110011111101011110 e2a33f3f9fd13f3f3f3f3f3f3fe2a33f3f9fd13f3f3f3f3f3f3f5e
EUC-JP 筌??湲???????筌??湲???????^ 111001001010010100111111001111111101111011010011001111110011111100111111001111110011111100111111001111111110010010100101001111110011111111011110110100110011111100111111001111110011111100111111001111110011111101011110 e4a53f3fded33f3f3f3f3f3f3fe4a53f3fded33f3f3f3f3f3f3f5e
UTF-8 筌좎뇴湲븀뼇溜묈떀泥퍭筌좎뇴湲븀뼇溜묈떀泥퍭^ 11100111101011011000110011101100101000101000111011101011100001111011010011100110101110011011001011101011101110001000000011101011101111001000011111101111101001111000101111101011101011001000100011101011100101101000000011101111101001111010001111101101100011011010110111100111101011011000110011101100101000101000111011101011100001111011010011100110101110011011001011101011101110001000000011101011101111001000011111101111101001111000101111101011101011001000100011101011100101101000000011101111101001111010001111101101100011011010110101011110 e7ad8ceca28eeb87b4e6b9b2ebb880ebbc87efa78bebac88eb9680efa7a3ed8dade7ad8ceca28eeb87b4e6b9b2ebb880ebbc87efa78bebac88eb9680efa7a3ed8dad5e
UHC 筌좎뇴湲븀뼇溜묈떀泥퍭筌좎뇴湲븀뼇溜묈떀泥퍭^ 111011111010011110100000111011001000011110011000111010101011100010111010111001111001011010010001111010101111111010010001111001011000101110010110111011001011001010111100010001001110111110100111101000001110110010000111100110001110101010111000101110101110011110010110100100011110101011111110100100011110010110001011100101101110110010110010101111000100010001011110 efa7a0ec8798eab8bae79691eafe91e58b96ecb2bc44efa7a0ec8798eab8bae79691eafe91e58b96ecb2bc445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)