To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 誰遜形誰遜族誰遜村誰遜造誰遜形誰遜族誰遜村誰遜造^ 10010010010011101001000110111011100011000110000010010010010011101001000110111011100100011011000010010010010011101001000110111011100100011011101010010010010011101001000110111011100100011010001010010010010011101001000110111011100011000110000010010010010011101001000110111011100100011011000010010010010011101001000110111011100100011011101010010010010011101001000110111011100100011010001001011110 924e91bb8c60924e91bb91b0924e91bb91ba924e91bb91a2924e91bb8c60924e91bb91b0924e91bb91ba924e91bb91a25e
EUC-JP 誰遜形誰遜族誰遜村誰遜造誰遜形誰遜族誰遜村誰遜造^ 11000011101011111100001010111101101101111100000111000011101011111100001010111101110000101011001011000011101011111100001010111101110000101011110011000011101011111100001010111101110000101010010011000011101011111100001010111101101101111100000111000011101011111100001010111101110000101011001011000011101011111100001010111101110000101011110011000011101011111100001010111101110000101010010001011110 c3afc2bdb7c1c3afc2bdc2b2c3afc2bdc2bcc3afc2bdc2a4c3afc2bdb7c1c3afc2bdc2b2c3afc2bdc2bcc3afc2bdc2a45e
UTF-8 誰遜形誰遜族誰遜村誰遜造誰遜形誰遜族誰遜村誰遜造^ 11101000101010101011000011101001100000011001110011100101101111011010001011101000101010101011000011101001100000011001110011100110100101111000111111101000101010101011000011101001100000011001110011100110100111011001000111101000101010101011000011101001100000011001110011101001100000001010000011101000101010101011000011101001100000011001110011100101101111011010001011101000101010101011000011101001100000011001110011100110100101111000111111101000101010101011000011101001100000011001110011100110100111011001000111101000101010101011000011101001100000011001110011101001100000001010000001011110 e8aab0e9819ce5bda2e8aab0e9819ce6978fe8aab0e9819ce69d91e8aab0e9819ce980a0e8aab0e9819ce5bda2e8aab0e9819ce6978fe8aab0e9819ce69d91e8aab0e9819ce980a05e
UHC 誰遜形誰遜族誰遜村誰遜造誰遜形誰遜族誰遜村誰遜造^ 11100010110000011110000111100001111110111010000111100010110000011110000111100001111100001110100111100010110000011110000111100001111101011011110111100010110000011110000111100001111100001110001111100010110000011110000111100001111110111010000111100010110000011110000111100001111100001110100111100010110000011110000111100001111101011011110111100010110000011110000111100001111100001110001101011110 e2c1e1e1fba1e2c1e1e1f0e9e2c1e1e1f5bde2c1e1e1f0e3e2c1e1e1fba1e2c1e1e1f0e9e2c1e1e1f5bde2c1e1e1f0e35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)