To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????\}????????\{^ 001111110011111100111111001111110011111100111111001111110011111101011100011111010011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 痔鉀ミ闔アノ゙\}痔鉀ミ闔アノ゙\{^ 1111001010101011100011101010010011111011110001011101000011101000100011101011000111001001110111100101110001111101111100101010101110001110101001001111101111000101110100001110100010001110101100011100100111011110010111000111101101011110 f2ab8ea4fbc5d0e88eb1c9de5c7df2ab8ea4fbc5d0e88eb1c9de5c7b5e
EUC-JP ?痔鉀ミ闔アノ゙\}?痔鉀ミ闔アノ゙\{^ 00111111101111001010011010001111111000111101100010001110110100001110111111101110100011101011000110001110110010011000111011011110010111000111110100111111101111001010011010001111111000111101100010001110110100001110111111101110100011101011000110001110110010011000111011011110010111000111101101011110 3fbca68fe3d88ed0efee8eb18ec98ede5c7d3fbca68fe3d88ed0efee8eb18ec98ede5c7b5e
UTF-8 痔鉀ミ闔アノ゙\}痔鉀ミ闔アノ゙\{^ 1110111010000111101000101110011110010111100101001110100110001001100000001110111110111110100100001110100110010111100101001110111110111101101100011110111110111110100010011110111110111110100111100101110001111101111011101000011110100010111001111001011110010100111010011000100110000000111011111011111010010000111010011001011110010100111011111011110110110001111011111011111010001001111011111011111010011110010111000111101101011110 ee87a2e79794e98980efbe90e99794efbdb1efbe89efbe9e5c7dee87a2e79794e98980efbe90e99794efbdb1efbe89efbe9e5c7b5e
UHC ?痔鉀?闔???\}?痔鉀?闔???\{^ 001111111111011011000000110010111010010100111111111110011110111100111111001111110011111101011100011111010011111111110110110000001100101110100101001111111111100111101111001111110011111100111111010111000111101101011110 3ff6c0cba53ff9ef3f3f3f5c7d3ff6c0cba53ff9ef3f3f3f5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)