To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲щ?侑わ?怨??繞???э?怨??阿 11100001100111111000010010001011001111111001100011010000100000101110110100111111100010011000010100111111001111111110001110000101001111110011111100111111100001001000111100111111100010011000010100111111001111111000100010100010 e19f848b3f98d082ed3f89853f3fe3853f3f3f848f3f89853f3f88a2
EUC-JP 癲щ?侑わ?怨??繞???э?怨??阿 11100010101000011010011111101011001111111101000011010010101001001110111100111111101100011110010100111111001111111110010111100101001111110011111100111111101001111110111100111111101100011110010100111111001111111011000010100100 e2a1a7eb3fd0d2a4ef3fb1e53f3fe5e53f3f3fa7ef3fb1e53f3fb0a4
UTF-8 癲щ뎽侑わ쭓怨뺤졆繞섏슜鱗э쭓怨뺤젋阿 11100111100110011011001011010001100010011110101110001110101111011110010010111110100100011110001110000010100011111110110010101101100100111110011010000000101010001110101110111010101001001110110010100001100001101110011110111001100111101110110010000100100011111110110010001010100111001110111110100111101100101101000110001101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010001011111010011001100010111111 e799b2d189eb8ebde4be91e3828fecad93e680a8ebbaa4eca186e7b99eec848fec8a9cefa7b2d18decad93e680a8ebbaa4eca08be998bf
UHC 癲щ뎽侑わ쭓怨뺤졆繞섏슜鱗э쭓怨뺤젋阿 1110111110100110101011001110101110001001100100001110101011100010101010101110111110100111100010111110101010110011100101011110110010100000101101111110100110100100100110001110110010011010101010011110110011100111101011001110111110100111100010111110101010110011100101011110110010100000100011001110010010111001 efa6aceb8990eae2aaefa78beab395eca0b7e9a498ec9aa9ece7acefa78beab395eca08ce4b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)