To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲щ?誼??銀レ?嚥 11100001100111111000010010001011001111111000101101100010001111110011111110001011111000101000001110001100001111111001101010001011 e19f848b3f8b623f3f8be2838c3f9a8b
EUC-JP 癲щ?誼??銀レ?嚥 11100010101000011010011111101011001111111011010111000011001111110011111110110110111001001010010111101100001111111101001111101011 e2a1a7eb3fb5c33f3fb6e4a5ec3fd3eb
UTF-8 癲щ돃誼썸에銀レ졐嚥 1110011110011001101100101101000110001001111010111000111110000011111010001010101010111100111011001000110110111000111011001001011110010000111010011000101010000000111000111000001110101100111011001010000110010000111001011001101010100101 e799b2d189eb8f83e8aabcec8db8ec9790e98a80e383aceca190e59aa5
UHC 癲щ돃誼썸에銀レ졐嚥 1110111110100110101011001110101110001001100101101110101111111110101111011110011010111111101000011110101111011110101010111110110010100000101111011110011010111111 efa6aceb8996ebfebde6bfa1ebdeabeca0bde6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)