To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 瘟?ⅹ倭?ぐ秧??}瘟?ⅹ倭?ぐ秧??{^ 11100001100010010011111111111010010010011001100001100000001111111000001010101110111000100101111000111111001111110111110111100001100010010011111111111010010010011001100001100000001111111000001010101110111000100101111000111111001111110111101101011110 e1893ffa4998603f82aee25e3f3f7de1893ffa4998603f82aee25e3f3f7b5e
EUC-JP 瘟??倭?ぐ秧??}瘟??倭?ぐ秧??{^ 1110000111101001001111110011111111001111110000010011111110100100101100001110001110111111001111110011111101111101111000011110100100111111001111111100111111000001001111111010010010110000111000111011111100111111001111110111101101011110 e1e93f3fcfc13fa4b0e3bf3f3f7de1e93f3fcfc13fa4b0e3bf3f3f7b5e
UTF-8 瘟룟ⅹ倭좄ぐ秧녔릍}瘟룟ⅹ倭좄ぐ秧녔릍{^ 111001111001100010011111111010111010001110011111111000101000010110111001111001011000000010101101111011001010001010000100111000111000000110010000111001111010011110100111111010111000010110010100111010111010011010001101011111011110011110011000100111111110101110100011100111111110001010000101101110011110010110000000101011011110110010100010100001001110001110000001100100001110011110100111101001111110101110000101100101001110101110100110100011010111101101011110 e7989feba39fe285b9e580adeca284e38190e7a7a7eb8594eba68d7de7989feba39fe285b9e580adeca284e38190e7a7a7eb8594eba68d7b5e
UHC 瘟룟ⅹ倭좄ぐ秧녔릍}瘟룟ⅹ倭좄ぐ秧녔릍{^ 111010001011000010110111111001011010010110101010111010001101111010100000111010001010101010110000111001001110101110110011111001101011100010101100011111011110100010110000101101111110010110100101101010101110100011011110101000001110100010101010101100001110010011101011101100111110011010111000101011000111101101011110 e8b0b7e5a5aae8dea0e8aab0e4ebb3e6b8ac7de8b0b7e5a5aae8dea0e8aab0e4ebb3e6b8ac7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)