To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 如??絶??鹽??}如??絶??鹽??{^ 100101000100000000111111001111111001000011100010001111110011111111101010011001000011111100111111011111011001010001000000001111110011111110010000111000100011111100111111111010100110010000111111001111110111101101011110 94403f3f90e23f3fea643f3f7d94403f3f90e23f3fea643f3f7b5e
EUC-JP 如??絶??鹽??}如??絶??鹽??{^ 110001111010000100111111001111111100000011100100001111110011111111110011110001010011111100111111011111011100011110100001001111110011111111000000111001000011111100111111111100111100010100111111001111110111101101011110 c7a13f3fc0e43f3ff3c53f3f7dc7a13f3fc0e43f3ff3c53f3f7b5e
UTF-8 如답툓絶뽭뇠鹽븀ㅎ}如답툓絶뽭뇠鹽븀ㅎ{^ 111001011010011010000010111010111000101110110101111011011000100010010011111001111011010110110110111010111011110110101101111010111000011110100000111010011011100110111101111010111011100010000000111000111000010110001110011111011110010110100110100000101110101110001011101101011110110110001000100100111110011110110101101101101110101110111101101011011110101110000111101000001110100110111001101111011110101110111000100000001110001110000101100011100111101101011110 e5a682eb8bb5ed8893e7b5b6ebbdadeb87a0e9b9bdebb880e3858e7de5a682eb8bb5ed8893e7b5b6ebbdadeb87a0e9b9bdebb880e3858e7b5e
UHC 如답툓絶뽭뇠鹽븀ㅎ}如답툓絶뽭뇠鹽븀ㅎ{^ 111001011111110110110100111001001011100010001010111011111011111010010110111010011000011110001000111001111010010010111010111001111010010010111110011111011110010111111101101101001110010010111000100010101110111110111110100101101110100110000111100010001110011110100100101110101110011110100100101111100111101101011110 e5fdb4e4b88aefbe96e98788e7a4bae7a4be7de5fdb4e4b88aefbe96e98788e7a4bae7a4be7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)