To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????W}???????????W{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 筌??湲???????W}筌??湲???????W{^ 11100010101000110011111100111111100111111101000100111111001111110011111100111111001111110011111100111111010101110111110111100010101000110011111100111111100111111101000100111111001111110011111100111111001111110011111100111111010101110111101101011110 e2a33f3f9fd13f3f3f3f3f3f3f577de2a33f3f9fd13f3f3f3f3f3f3f577b5e
EUC-JP 筌??湲???????W}筌??湲???????W{^ 11100100101001010011111100111111110111101101001100111111001111110011111100111111001111110011111100111111010101110111110111100100101001010011111100111111110111101101001100111111001111110011111100111111001111110011111100111111010101110111101101011110 e4a53f3fded33f3f3f3f3f3f3f577de4a53f3fded33f3f3f3f3f3f3f577b5e
UTF-8 筌좎뇴湲븀뼇溜묈떀泥펗W}筌좎뇴湲븀뼇溜묈떀泥펗W{^ 1110011110101101100011001110110010100010100011101110101110000111101101001110011010111001101100101110101110111000100000001110101110111100100001111110111110100111100010111110101110101100100010001110101110010110100000001110111110100111101000111110110110001110100101110101011101111101111001111010110110001100111011001010001010001110111010111000011110110100111001101011100110110010111010111011100010000000111010111011110010000111111011111010011110001011111010111010110010001000111010111001011010000000111011111010011110100011111011011000111010010111010101110111101101011110 e7ad8ceca28eeb87b4e6b9b2ebb880ebbc87efa78bebac88eb9680efa7a3ed8e97577de7ad8ceca28eeb87b4e6b9b2ebb880ebbc87efa78bebac88eb9680efa7a3ed8e97577b5e
UHC 筌좎뇴湲븀뼇溜묈떀泥펗W}筌좎뇴湲븀뼇溜묈떀泥펗W{^ 11101111101001111010000011101100100001111001100011101010101110001011101011100111100101101001000111101010111111101001000111100101100010111001011011101100101100101011110001101011010101110111110111101111101001111010000011101100100001111001100011101010101110001011101011100111100101101001000111101010111111101001000111100101100010111001011011101100101100101011110001101011010101110111101101011110 efa7a0ec8798eab8bae79691eafe91e58b96ecb2bc6b577defa7a0ec8798eab8bae79691eafe91e58b96ecb2bc6b577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)