To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ?暴??器萄?賀賦?暴??器萄?賀鳧E 0011111110010110010111000011111100111111100010101110110110010011101110000011111110001001111010101001010110001010001111111001011001011100001111110011111110001010111011011001001110111000001111111000100111101010111010011110100001000101 3f965c3f3f8aed93b83f89ea958a3f965c3f3f8aed93b83f89eae9e845
EUC-JP ?暴??器萄?賀賦?暴??器萄?賀鳧E 0011111111001011101111010011111100111111101101001110111111000110101110100011111110110010111011001100100111101010001111111100101110111101001111110011111110110100111011111100011010111010001111111011001011101100111100101110101001000101 3fcbbd3f3fb4efc6ba3fb2ecc9ea3fcbbd3f3fb4efc6ba3fb2ecf2ea45
UTF-8 뤋暴첂샘器萄뤋賀賦뤋暴첂샘器萄뤋賀鳧E 11101011101001001000101111100110100110101011010011101100101100101000001011101100100000111001100011100101100110011010100011101000100100001000010011101011101001001000101111101000101100111000000011101000101100111010011011101011101001001000101111100110100110101011010011101100101100101000001011101100100000111001100011100101100110011010100011101000100100001000010011101011101001001000101111101000101100111000000011101001101100111010011101000101 eba48be69ab4ecb282ec8398e599a8e89084eba48be8b380e8b3a6eba48be69ab4ecb282ec8398e599a8e89084eba48be8b380e9b3a745
UHC 뤋暴첂샘器萄뤋賀賦뤋暴첂샘器萄뤋賀鳧E 10001111101110111111100011101100101010101000111110111011111110011101000011101111110101001010110010001111101110111111100111000101110111011011011110001111101110111111100011101100101010101000111110111011111110011101000011101111110101001010110010001111101110111111100111000101110111011100000001000101 8fbbf8ecaa8fbbf9d0efd4ac8fbbf9c5ddb78fbbf8ecaa8fbbf9d0efd4ac8fbbf9c5ddc045

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)