To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 檍??二??儒??筌??釉f?楡λ?筌 10011110111110000011111100111111100100111111000100111111001111111000111011110010001111110011111111100010101000110011111100111111111001111101011010000010100001100011111110011110101111101000001111001001001111111110001010100011 9ef83f3f93f13f3f8ef23f3fe2a33f3fe7d682863f9ebe83c93fe2a3
EUC-JP 檍??二??儒??筌??釉f?楡λ?筌 11011100111110100011111100111111110001101111001100111111001111111011110011110100001111110011111111100100101001010011111100111111111011101101100010100011111001100011111111011100110000001010011011001011001111111110010010100101 dcfa3f3fc6f33f3fbcf43f3fe4a53f3feed8a3e63fdcc0a6cb3fe4a5
UTF-8 檍용슢二깍쭓儒붽묽筌뗫쪇釉f에楡λ틦筌 1110011010101010100011011110110010011010101010011110110010001010101000101110010010111010100011001110101010111001100011011110110010101101100100111110010110000100100100101110101110110110101111011110101110101100101111011110011110101101100011001110101110010111101010111110110010101010100001111110100110000111100010011110111110111101100001101110110010010111100100001110011010100101101000011100111010111011111011011000101110100110111001111010110110001100 e6aa8dec9aa9ec8aa2e4ba8ceab98decad93e58492ebb6bdebacbde7ad8ceb97abecaa87e98789efbd86ec9790e6a5a1cebbed8ba6e7ad8c
UHC 檍용슢二깍쭓儒붽묽筌뗫쪇釉f에楡λ틦筌 1110010111100101101111111110101110011010101011101110110010100011101100011110111110100111100010111110101011100011100101001110101010111001101100011110111110100111100010111110101110100101100000011110101110111000101000111110011010111111101000011110101011111000101001011110101110111010100100001110111110100111 e5e5bfeb9aaeeca3b1efa78beae394eab9b1efa78beba581ebb8a3e6bfa1eaf8a5ebba90efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)