To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪他揃誰遜他辿測俗奪造族誰遜則誰遜側 100100100100011110010001101111001001000110110101100100100100111010010001101110111001000110111100100100100100100010010001101010101001000110101101100100100100010010010001101000101001000110110000100100100100111010010001101110111001000110100101100100100100111010010001101110111001000110100100 924791bc91b5924e91bb91bc924891aa91ad924491a291b0924e91bb91a5924e91bb91a4
EUC-JP 竪他揃誰遜他辿測俗奪造族誰遜則誰遜側 110000111010100011000010101111101100001010110111110000111010111111000010101111011100001010111110110000111010100111000010101011001100001010101111110000111010010111000010101001001100001010110010110000111010111111000010101111011100001010100111110000111010111111000010101111011100001010100110 c3a8c2bec2b7c3afc2bdc2bec3a9c2acc2afc3a5c2a4c2b2c3afc2bdc2a7c3afc2bdc2a6
UTF-8 竪他揃誰遜他辿測俗奪造族誰遜則誰遜側 111001111010101110101010111001001011101110010110111001101000111110000011111010001010101010110000111010011000000110011100111001001011101110010110111010001011111010111111111001101011100010101100111001001011111110010111111001011010010110101010111010011000000010100000111001101001011110001111111010001010101010110000111010011000000110011100111001011000100110000111111010001010101010110000111010011000000110011100111001011000000110110100 e7abaae4bb96e68f83e8aab0e9819ce4bb96e8bebfe6b8ace4bf97e5a5aae980a0e6978fe8aab0e9819ce58987e8aab0e9819ce581b4
UHC 竪他?誰遜他?測俗奪造族誰遜則誰遜側 11100010101101011111011011100010001111111110001011000001111000011110000111110110111000100011111111110110101101001110000111010100111101111010110011110000111000111111000011101001111000101100000111100001111000011111011011001110111000101100000111100001111000011111011010110000 e2b5f6e23fe2c1e1e1f6e23ff6b4e1d4f7acf0e3f0e9e2c1e1e1f6cee2c1e1e1f6b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)