To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????±??????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3fb13f3f3f3f3f3f3f
SJIS-WIN 筌??認??????レ±油??音?シ筌 111000101010001100111111001111111001010001000110001111110011111100111111001111110011111100111111100000111000110010000001011111011001011011111011001111110011111110001001101110010011111110000011010101101110001010100011 e2a33f3f94463f3f3f3f3f3f838c817d96fb3f3f89b93f8356e2a3
EUC-JP 筌??認??洧???レ±油??音?シ筌 1110010010100101001111110011111111000111101001110011111100111111100011111100011110110100001111110011111100111111101001011110110010100001110111101100110011111101001111110011111110110010101110110011111110100101101101111110010010100101 e4a53f3fc7a73f3f8fc7b43f3f3fa5eca1deccfd3f3fb2bb3fa5b7e4a5
UTF-8 筌뚮뱷認뗰ℓ洧댁뿉曆レ±油륅쭓音섏シ筌 1110011110101101100011001110101110011010101011101110101110110001101101111110100010101010100011011110101110010111101100001110001010000100100100111110011010110100101001111110101110001100100000011110101110111111100010011110111110100110100010111110001110000011101011001100001010110001111001101011001010111001111010111010010110000101111011001010110110010011111010011001111110110011111011001000010010001111111000111000001010110111111001111010110110001100 e7ad8ceb9aaeebb1b7e8aa8deb97b0e28493e6b4a7eb8c81ebbf89efa68be383acc2b1e6b2b9eba585ecad93e99fb3ec848fe382b7e7ad8c
UHC 筌뚮뱷認뗰ℓ洧댁뿉曆レ±油륅쭓音섏シ筌 1110111110100111100011001110101110010011100111011110110011100011100010111110111110100111101001001110101011111011101101001110110010010111100100001110011010110111101010111110110010100001101111101110101011111010100011111110111110100111100010111110101111100101100110001110110010101011101101111110111110100111 efa78ceb939dece38befa7a4eafbb4ec9790e6b7abeca1beeafa8fefa78bebe598ecabb7efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)