To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 擁?????荏??[擁?????荏??[^ 10010111011010010011111100111111001111110011111100111111100010010110000000111111001111110101101110010111011010010011111100111111001111110011111100111111100010010110000000111111001111110101101101011110 97693f3f3f3f3f89603f3f5b97693f3f3f3f3f89603f3f5b5e
EUC-JP 擁?????荏??[擁?????荏??[^ 11001101110010100011111100111111001111110011111100111111101100011100000100111111001111110101101111001101110010100011111100111111001111110011111100111111101100011100000100111111001111110101101101011110 cdca3f3f3f3f3fb1c13f3f5bcdca3f3f3f3f3fb1c13f3f5b5e
UTF-8 擁숉쓽淋낁텤荏섑쉫[擁숉쓽淋낁텤荏섑쉫[^ 111001101001001110000001111011001000100010001001111011001001001110111101111011111010011110110101111010111000001010000001111011011000010110100100111010001000110110001111111011001000010010010001111011001000100110101011010110111110011010010011100000011110110010001000100010011110110010010011101111011110111110100111101101011110101110000010100000011110110110000101101001001110100010001101100011111110110010000100100100011110110010001001101010110101101101011110 e69381ec8889ec93bdefa7b5eb8281ed85a4e88d8fec8491ec89ab5be69381ec8889ec93bdefa7b5eb8281ed85a4e88d8fec8491ec89ab5b5e
UHC 擁숉쓽淋낁텤荏섑쉫[擁숉쓽淋낁텤荏섑쉫[^ 111010001011011010011001111011011001110110011000111011001111100010000101111010001011011010011001111011001111101110011000111011011001101010000101010110111110100010110110100110011110110110011101100110001110110011111000100001011110100010110110100110011110110011111011100110001110110110011010100001010101101101011110 e8b699ed9d98ecf885e8b699ecfb98ed9a855be8b699ed9d98ecf885e8b699ecfb98ed9a855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)