To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 厓??敖?????}v厓??敖?????}vB 111110101000110100111111001111111001110111000010001111110011111100111111001111110011111101111101011101101111101010001101001111110011111110011101110000100011111100111111001111110011111100111111011111010111011001000010 fa8d3f3f9dc23f3f3f3f3f7d76fa8d3f3f9dc23f3f3f3f3f7d7642
EUC-JP 厓??敖?????}v厓??敖?????}vB 1000111110110100110001110011111100111111110110101100010000111111001111110011111100111111001111110111110101110110100011111011010011000111001111110011111111011010110001000011111100111111001111110011111100111111011111010111011001000010 8fb4c73f3fdac43f3f3f3f3f7d768fb4c73f3fdac43f3f3f3f3f7d7642
UTF-8 厓김츖敖뉓썔連양ㅎ}v厓김츖敖뉓썔連양ㅎ}vB 1110010110001110100100111110101010111001100000001110110010111000100101101110011010010101100101101110101110001001100100111110110010001101100101001110111110100110100110101110110010010110100100011110001110000101100011100111110101110110111001011000111010010011111010101011100110000000111011001011100010010110111001101001010110010110111010111000100110010011111011001000110110010100111011111010011010011010111011001001011010010001111000111000010110001110011111010111011001000010 e58e93eab980ecb896e69596eb8993ec8d94efa69aec9691e3858e7d76e58e93eab980ecb896e69596eb8993ec8d94efa69aec9691e3858e7d7642
UHC 厓김츖敖뉓썔連양ㅎ}v厓김츖敖뉓썔連양ㅎ}vB 1110010011101101101100011110100010101110100100001110011111111001100001111110100010011011100001111110011011100110101111101110011110100100101111100111110101110110111001001110110110110001111010001010111010010000111001111111100110000111111010001001101110000111111001101110011010111110111001111010010010111110011111010111011001000010 e4edb1e8ae90e7f987e89b87e6e6bee7a4be7d76e4edb1e8ae90e7f987e89b87e6e6bee7a4be7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)