To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 隲、螳」豎暮ォサ蠖掵隲、螳」豎暮ォサ蠖掉^ 111010001010101110100100111001011010111010100011111001101011000110010101111010011010101110111011111001011011110110011101011111011110100010101011101001001110010110101110101000111110011010110001100101011110100110101011101110111110010110111101100111010111101101011110 e8aba4e5aea3e6b195e9abbbe5bd9d7de8aba4e5aea3e6b195e9abbbe5bd9d7b5e
EUC-JP 隲、螳」豎暮ォサ蠖掵隲、螳」豎暮ォサ蠖掉^ 1111000010101101100011101010010011101010101100001000111010100011111011001011001111001010111010111000111010101011100011101011101111101010101111111101100111011110111100001010110110001110101001001110101010110000100011101010001111101100101100111100101011101011100011101010101110001110101110111110101010111111110110011101110001011110 f0ad8ea4eab08ea3ecb3caeb8eab8ebbeabfd9def0ad8ea4eab08ea3ecb3caeb8eab8ebbeabfd9dc5e
UTF-8 隲、螳」豎暮ォサ蠖掵隲、螳」豎暮ォサ蠖掉^ 11101001100110101011001011101111101111011010010011101000100111101011001111101111101111011010001111101000101100011000111011100110100110101010111011101111101111011010101111101111101111011011101111101000101000001001011011100110100011101011010111101001100110101011001011101111101111011010010011101000100111101011001111101111101111011010001111101000101100011000111011100110100110101010111011101111101111011010101111101111101111011011101111101000101000001001011011100110100011101000100101011110 e99ab2efbda4e89eb3efbda3e8b18ee69aaeefbdabefbdbbe8a096e68eb5e99ab2efbda4e89eb3efbda3e8b18ee69aaeefbdabefbdbbe8a096e68e895e
UHC ??螳??暮??????螳??暮???掉^ 0011111100111111110100111101100100111111001111111101100110111010001111110011111100111111001111110011111100111111110100111101100100111111001111111101100110111010001111110011111100111111110100111111110001011110 3f3fd3d93f3fd9ba3f3f3f3f3f3fd3d93f3fd9ba3f3f3fd3fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)