To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????F?????????????? 001111110011111100111111001111110011111100111111010001100011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f463f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪遜造誰遜足F巽端損脱則束誰遜促誰遜捉竪湛 1001001001000111100100011011101110010001101000101001001001001110100100011011101110010001101010110100011010010010010001101001001001011011100100011011100110010010010001011001000110100101100100011010100110010010010011101001000110111011100100011010001110010010010011101001000110111011100100011010100010010010010001111001001001011000 924791bb91a2924e91bb91ab469246925b91b9924591a591a9924e91bb91a3924e91bb91a892479258
EUC-JP 竪遜造誰遜足F巽端損脱則束誰遜促誰遜捉竪湛 1100001110101000110000101011110111000010101001001100001110101111110000101011110111000010101011010100011011000011101001111100001110111100110000101011101111000011101001101100001010100111110000101010101111000011101011111100001010111101110000101010010111000011101011111100001010111101110000101010101011000011101010001100001110111001 c3a8c2bdc2a4c3afc2bdc2ad46c3a7c3bcc2bbc3a6c2a7c2abc3afc2bdc2a5c3afc2bdc2aac3a8c3b9
UTF-8 竪遜造誰遜足F巽端損脱則束誰遜促誰遜捉竪湛 11100111101010111010101011101001100000011001110011101001100000001010000011101000101010101011000011101001100000011001110011101000101101101011001101000110111001011011011110111101111001111010101110101111111001101001000010001101111010001000010010110001111001011000100110000111111001101001110110011111111010001010101010110000111010011000000110011100111001001011111110000011111010001010101010110000111010011000000110011100111001101000110110001001111001111010101110101010111001101011100110011011 e7abaae9819ce980a0e8aab0e9819ce8b6b346e5b7bde7abafe6908de884b1e58987e69d9fe8aab0e9819ce4bf83e8aab0e9819ce68d89e7abaae6b99b
UHC 竪遜造誰遜足F巽端損?則束誰遜促誰遜捉竪湛 11100010101101011110000111100001111100001110001111100010110000011110000111100001111100001110101101000110111000011101111011010011101011101110000111011111001111111111011011001110111000011101011011100010110000011110000111100001111101011011010111100010110000011110000111100001111100111011010111100010101101011101001111000000 e2b5e1e1f0e3e2c1e1e1f0eb46e1ded3aee1df3ff6cee1d6e2c1e1e1f5b5e2c1e1e1f3b5e2b5d3c0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)