To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???踰??節??????ι?蟻??億??? 001111110011111100111111111001101111101000111111001111111001000011011111001111110011111100111111001111110011111100111111100000111100011100111111100010110110000100111111001111111000100110101101001111110011111100111111 3f3f3fe6fa3f3f90df3f3f3f3f3f3f83c73f8b613f3f89ad3f3f3f
EUC-JP ???踰??節?????佾ι?蟻??億??? 0011111100111111001111111110110011111100001111110011111111000000111000010011111100111111001111110011111100111111100011111011000011111011101001101100100100111111101101011100001000111111001111111011001010101111001111110011111100111111 3f3f3fecfc3f3fc0e13f3f3f3f3f8fb0fba6c93fb5c23f3fb2af3f3f3f
UTF-8 閱묐갭踰딉쭏節녿겱捻믠뫁佾ι쉬蟻숇쾴億됰뀘溜 1110100110010110101100011110101110101100100100001110101010110000101011011110100010111000101100001110101110010100100010011110110010101101100011111110011110101111100000001110101110000101101111111110101010110010101100011110111110100110101001001110101110101111101000001110101110101011100000011110010010111101101111101100111010111001111011001000100110101100111010001001111110111011111011001000100010000111111011001011111010110100111001011000010010000100111010111001000010110000111010111000000010011000111011111010011110001011 e996b1ebac90eab0ade8b8b0eb9489ecad8fe7af80eb85bfeab2b1efa6a4ebafa0ebab81e4bdbeceb9ec89ace89fbbec8887ecbeb4e58484eb90b0eb8098efa78b
UHC 閱묐갭踰딉쭏節녿겱捻믠뫁佾ι쉬蟻숇쾴億됰뀘溜 1110011011110011100100011110101110110000101110001110101110110010100010101110111110100111100010001110111110111101100001101110101110000001101111011110011011110111100100101110001010010001101001011110110011101011101001011110100110111101101011001110101111111100100110011110101110110010100010101110010111100010100010011110101110000101100100011110101011111110 e6f391ebb0b8ebb28aefa788efbd86eb81bde6f792e291a5eceba5e9bdacebfc99ebb28ae5e289eb8591eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)