To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀?????循??繹??魏??擬????? 1000100010100011001111110011111100111111001111110011111110001111011110100011111100111111111000111000100000111111001111111110100110110000001111110011111110001011010110110011111100111111001111110011111100111111 88a33f3f3f3f3f8f7a3f3fe3883f3fe9b03f3f8b5b3f3f3f3f3f
EUC-JP 哀?????循??繹??魏??擬????? 1011000010100101001111110011111100111111001111110011111110111101110110110011111100111111111001011110100000111111001111111111001010110010001111110011111110110101101111000011111100111111001111110011111100111111 b0a53f3f3f3f3fbddb3f3fe5e83f3ff2b23f3fb5bc3f3f3f3f3f
UTF-8 哀얜챶理롧댚循덈닱繹먮굞魏좈뒽擬뉕데閱고굷 111001011001001110000000111011001001011010011100111011001011000110110110111011111010011110100100111010111010000110100111111010111000110010011010111001011011111010101010111010111000110110001000111010111000101110110001111001111011100110111001111010111010100010101110111010101011010110011110111010011010110110001111111011001010001010001000111010111001001010111101111001101001001110101100111010111000100110010101111010111000110110110000111010011001011010110001111010101011001110100000111010101011010110110111 e59380ec969cecb1b6efa7a4eba1a7eb8c9ae5beaaeb8d88eb8bb1e7b9b9eba8aeeab59ee9ad8feca288eb92bde693aceb8995eb8db0e996b1eab3a0eab5b7
UHC 哀얜챶理롧댚循덈닱繹먮굞魏좈뒽擬뉕데閱고굷 111001001110111010111110111010111010101010000011111011001011010110001110111001111000100010111110111000101110000010001000111010111000100010100111111001101011101010010000111010111000001010000110111010101110000010100000111010011000101010110011111010111111010010000111111010101011010110100101111001101111001110110000111011011000001010010110 e4eebeebaa83ecb58ee788bee2e088eb88a7e6ba90eb8286eae0a0e98ab3ebf487eab5a5e6f3b0ed8296

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)