To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪端即誰遜袖巽端続奪造族誰遜 10010010010001111001001001011011100100011010011010010010010011101001000110111011100100011011001110010010010001101001001001011011100100011011000110010010010001001001000110100010100100011011000010010010010011101001000110111011 9247925b91a6924e91bb91b39246925b91b1924491a291b0924e91bb
EUC-JP 竪端即誰遜袖巽端続奪造族誰遜 11000011101010001100001110111100110000101010100011000011101011111100001010111101110000101011010111000011101001111100001110111100110000101011001111000011101001011100001010100100110000101011001011000011101011111100001010111101 c3a8c3bcc2a8c3afc2bdc2b5c3a7c3bcc2b3c3a5c2a4c2b2c3afc2bd
UTF-8 竪端即誰遜袖巽端続奪造族誰遜 111001111010101110101010111001111010101110101111111001011000110110110011111010001010101010110000111010011000000110011100111010001010001010010110111001011011011110111101111001111010101110101111111001111011011010011010111001011010010110101010111010011000000010100000111001101001011110001111111010001010101010110000111010011000000110011100 e7abaae7abafe58db3e8aab0e9819ce8a296e5b7bde7abafe7b69ae5a5aae980a0e6978fe8aab0e9819c
UHC 竪端?誰遜袖巽端?奪造族誰遜 1110001010110101110100111010111000111111111000101100000111100001111000011110001011000000111000011101111011010011101011100011111111110111101011001111000011100011111100001110100111100010110000011110000111100001 e2b5d3ae3fe2c1e1e1e2c0e1ded3ae3ff7acf0e3f0e9e2c1e1e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)