To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蜈??節b?穩ч?節??堯??松??倭?? 111001011000010100111111001111111001000011011111100000101000001000111111111000100111001010000100100010010011111110010000110111110011111100111111111010101001111100111111001111111000111110111100001111110011111110011000011000000011111100111111 e5853f3f90df82823fe27284893f90df3f3fea9f3f3f8fbc3f3f98603f3f
EUC-JP 蜈??節b?穩ч?節??堯??松??倭?? 111010011110010100111111001111111100000011100001101000111110001000111111111000111101001110100111111010010011111111000000111000010011111100111111111101001010000100111111001111111011111010111110001111110011111111001111110000010011111100111111 e9e53f3fc0e1a3e23fe3d3a7e93fc0e13f3ff4a13f3fbebe3f3fcfc13f3f
UTF-8 蜈좈뜈節b댌穩ч걠節븃쪧堯뗰숲松듣괵倭욆녉 1110100010011100100010001110110010100010100010001110101110011100100010001110011110101111100000001110111110111101100000101110101110001100100011001110011110101001101010011101000110000111111010101011000110100000111001111010111110000000111010111011100010000011111011001010101010100111111001011010000010101111111010111001011110110000111011001000100010110010111001101001110110111110111010111001001110100011111010101011010010110101111001011000000010101101111011001001101010000110111010111000010110001001 e89c88eca288eb9c88e7af80efbd82eb8c8ce7a9a9d187eab1a0e7af80ebb883ecaaa7e5a0afeb97b0ec88b2e69dbeeb93a3eab4b5e580adec9a86eb8589
UHC 蜈좈뜈節b댌穩ч걠節븃쪧堯뗰숲松듣괵倭욆녉 111010001010010110100000111010011000110110001011111011111011110110100011111000101000100010110101111010001011000110101100111010011000000110001001111011111011110110111010111010001010010110100000111010001110101110001011111011111011110110100011111000011110011010110101111010001011000110101100111010001101111010011110111010001000011010111111 e8a5a0e98d8befbda3e288b5e8b1ace98189efbdbae8a5a0e8eb8befbda3e1e6b5e8b1ace8de9ee886bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)