To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????意??泣???????????揖 00111111001111110011111100111111001111110011111110001000110100110011111100111111100010111000001100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001011101001011 3f3f3f3f3f3f88d33f3f8b833f3f3f3f3f3f3f3f3f3f3f974b
EUC-JP ??????意??泣???????????揖 00111111001111110011111100111111001111110011111110110000110101010011111100111111101101011110001100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100110110101100 3f3f3f3f3f3fb0d53f3fb5e33f3f3f3f3f3f3f3f3f3f3fcdac
UTF-8 溜삳젍溜삥뿼意살뵽泣곷졎溜롫졎嶪쇨섹溜싲졋揖 111011111010011110001011111011001000001010110011111011001010000010001101111011111010011110001011111011001000001010100101111010111011111110111100111001101000010010001111111011001000001010110100111010111011010110111101111001101011001110100011111010101011001110110111111011001010000110001110111011111010011110001011111010111010000110101011111011001010000110001110111001011011011010101010111011001000011110101000111011001000010010111001111011111010011110001011111011001000101110110010111011001010000110001011111001101000111110010110 efa78bec82b3eca08defa78bec82a5ebbfbce6848fec82b4ebb5bde6b3a3eab3b7eca18eefa78beba1abeca18ee5b6aaec87a8ec84b9efa78bec8bb2eca18be68f96
UHC 溜삳젍溜삥뿼意살뵽泣곷졎溜롫졎嶪쇨섹溜싲졋揖 1110101011111110101110111110101110100000100011101110101011111110101110111110011010010111101111001110101111110010101110111110110010010100101110111110101111101000100000011110101110100000101110111110101011111110100011101110101110100000101110111110010111110101101111001110101010111100101111011110101011111110100110101110101110100000101110101110101111100111 eafebbeba08eeafebbe697bcebf2bbec94bbebe881eba0bbeafe8eeba0bbe5f5bceabcbdeafe9aeba0baebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)