To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??儒??馭??????伊??碎?? 001111110011111100111111100010111000001100111111001111111000111011110010001111110011111111101001011001100011111100111111001111110011111100111111001111111000100011001001001111110011111111100001111010100011111100111111 3f3f3f8b833f3f8ef23f3fe9663f3f3f3f3f3f88c93f3fe1ea3f3f
EUC-JP ???泣??儒??馭??????伊??碎?? 001111110011111100111111101101011110001100111111001111111011110011110100001111110011111111110001110001110011111100111111001111110011111100111111001111111011000011001011001111110011111111100010111011000011111100111111 3f3f3fb5e33f3fbcf43f3ff1c73f3f3f3f3f3fb0cb3f3fe2ec3f3f
UTF-8 捻뀁늿泣㎩쳞儒묐젡馭귂딅뮋嶺뚮돆伊됧젆碎좎쑂 111011111010011010100100111010111000000010000001111010111000101010111111111001101011001110100011111000111000111010101001111011001011001110011110111001011000010010010010111010111010110010010000111011001010000010100001111010011010011010101101111010101011011110000010111010111001010010000101111010111010111010001011111011111010011010101011111010111001101010101110111010111000111110000110111001001011110010001010111010111001000010100111111011001010000010000110111001111010001010001110111011001010001010001110111011001001000110000010 efa6a4eb8081eb8abfe6b3a3e38ea9ecb39ee58492ebac90eca0a1e9a6adeab782eb9485ebae8befa6abeb9aaeeb8f86e4bc8aeb90a7eca086e7a28eeca28eec9182
UHC 捻뀁늿泣㎩쳞儒묐젡馭귂딅뮋嶺뚮돆伊됧젆碎좎쑂 1110011011110111101100101110110010001000100010001110101111101000101001111110010110101011100001001110101011100011100100011110101110100000100110101110010111011111100000101101000110001010111010111001001010011001111001111010110110001100111010111000100110010111111011001010010110001001111001011010000010001001111000011110111110100000111011001001110010100010 e6f7b2ec8888ebe8a7e5ab84eae391eba09ae5df82d18aeb9299e7ad8ceb8997eca589e5a089e1efa0ec9ca2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)