To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歟??踰???≪?狎??揖?????余??諭 1001111101100010001111110011111111100110111110100011111100111111001111111000000111100001001111111110000010111110001111110011111110010111010010110011111100111111001111110011111100111111100101110101110100111111001111111001011101000000 9f623f3fe6fa3f3f3f81e13fe0be3f3f974b3f3f3f3f3f975d3f3f9740
EUC-JP 歟??踰??洹≪?狎??揖?????余??諭 11011101110000110011111100111111111011001111110000111111001111111000111111000111101110101010001011100011001111111110000011000000001111110011111111001101101011000011111100111111001111110011111100111111110011011011111000111111001111111100110110100001 ddc33f3fecfc3f3f8fc7baa2e33fe0c03f3fcdac3f3f3f3f3fcdbe3f3fcda1
UTF-8 歟㏓뀛踰ㅵ푻洹≪뒭狎녿쪋揖곁뭐琉밸닲余쒓꼍諭 111001101010110110011111111000111000111110010011111010111000000010011011111010001011100010110000111000111000010110110101111011011001000110111011111001101011010010111001111000101000100110101010111010111001001010101101111001111000101110001110111010111000010110111111111011001010101010001011111001101000111110010110111010101011001110000001111010111010110110010000111011111010011110001100111010111011000010111000111010111000101110110010111001001011110110011001111011001001001010010011111010101011110010001101111010001010101110101101 e6ad9fe38f93eb809be8b8b0e385b5ed91bbe6b4b9e289aaeb92ade78b8eeb85bfecaa8be68f96eab381ebad90efa78cebb0b8eb8bb2e4bd99ec9293eabc8de8abad
UHC 歟㏓뀛踰ㅵ푻洹≪뒭狎녿쪋揖곁뭐琉밸닲余쒓꼍諭 1110011010100010101001111110101110000101100101001110101110110010101001001110010110111110100001111110101010110111101000011110110010001010101001101110010011100100100001101110101110100101100001011110101111100111101100001110011110111001101110011110101110100100101110011110101110001000101010001110010111111001100111001110101010110010101111011110101110110001 e6a2a7eb8594ebb2a4e5be87eab7a1ec8aa6e4e486eba585ebe7b0e7b9b9eba4b9eb88a8e5f99ceab2bdebb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)