To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????k}?????????k{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110101101111101001111110011111100111111001111110011111100111111001111110011111100111111011010110111101101011110 3f3f3f3f3f3f3f3f3f6b7d3f3f3f3f3f3f3f3f3f6b7b5e
SJIS-WIN 倭??搖?????k}倭??搖?????k{^ 100110000110000000111111001111111001110110001010001111110011111100111111001111110011111101101011011111011001100001100000001111110011111110011101100010100011111100111111001111110011111100111111011010110111101101011110 98603f3f9d8a3f3f3f3f3f6b7d98603f3f9d8a3f3f3f3f3f6b7b5e
EUC-JP 倭??搖??旿??k}倭??搖??旿??k{^ 11001111110000010011111100111111110110011110101000111111001111111000111111000001111101000011111100111111011010110111110111001111110000010011111100111111110110011110101000111111001111111000111111000001111101000011111100111111011010110111101101011110 cfc13f3fd9ea3f3f8fc1f43f3f6b7dcfc13f3fd9ea3f3f8fc1f43f3f6b7b5e
UTF-8 倭욑스搖㏆쉿旿랃쉑k}倭욑스搖㏆쉿旿랃쉑k{^ 1110010110000000101011011110110010011010100100011110110010001010101001001110011010010000100101101110001110001111100001101110110010001001101111111110011010010111101111111110101110011110100000111110110010001001100100010110101101111101111001011000000010101101111011001001101010010001111011001000101010100100111001101001000010010110111000111000111110000110111011001000100110111111111001101001011110111111111010111001111010000011111011001000100110010001011010110111101101011110 e580adec9a91ec8aa4e69096e38f86ec89bfe697bfeb9e83ec89916b7de580adec9a91ec8aa4e69096e38f86ec89bfe697bfeb9e83ec89916b7b5e
UHC 倭욑스搖㏆쉿旿랃쉑k}倭욑스搖㏆쉿旿랃쉑k{^ 1110100011011110100111101110111110111101101110101110100011110100101001111110111110111101101100101110011111111010100011011110111110111101101001110110101101111101111010001101111010011110111011111011110110111010111010001111010010100111111011111011110110110010111001111111101010001101111011111011110110100111011010110111101101011110 e8de9eefbdbae8f4a7efbdb2e7fa8defbda76b7de8de9eefbdbae8f4a7efbdb2e7fa8defbda76b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)