To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN ?櫻存ア??ご???櫻存ア??ご??E 001111111001111101001110100100011011011010000011010000010011111100111111100000101011001000111111001111110011111110011111010011101001000110110110100000110100000100111111001111111000001010110010001111110011111101000101 3f9f4e91b683413f3f82b23f3f3f9f4e91b683413f3f82b23f3f45
EUC-JP ?櫻存ア??ご???櫻存ア??ご??E 001111111101110110101111110000101011100010100101101000100011111100111111101001001011010000111111001111110011111111011101101011111100001010111000101001011010001000111111001111111010010010110100001111110011111101000101 3fddafc2b8a5a23f3fa4b43f3f3fddafc2b8a5a23f3fa4b43f3f45
UTF-8 룵櫻存ア룶殺ご룵햊룵櫻存ア룶殺ご룵햍E 11101011101000111011010111100110101010111011101111100101101011011001100011100011100000101010001011101011101000111011011011101111101001011011000011100011100000011001010011101011101000111011010111101101100101101000101011101011101000111011010111100110101010111011101111100101101011011001100011100011100000101010001011101011101000111011011011101111101001011011000011100011100000011001010011101011101000111011010111101101100101101000110101000101 eba3b5e6abbbe5ad98e382a2eba3b6efa5b0e38194eba3b5ed968aeba3b5e6abbbe5ad98e382a2eba3b6efa5b0e38194eba3b5ed968d45
UHC 룵櫻存ア룶殺ご룵햊룵櫻存ア룶殺ご룵햍E 10001111101010101110010110100001111100001110110110101011101000101000111110101011111000011110110110101010101101001000111110101010110000010101100110001111101010101110010110100001111100001110110110101011101000101000111110101011111000011110110110101010101101001000111110101010110000010110001001000101 8faae5a1f0edaba28fabe1edaab48faac1598faae5a1f0edaba28fabe1edaab48faac16245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)