To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄λ????矣??永??毅?Ⅴ???永??鎰? 10010110111011111000001111001001001111110011111100111111001111111110000111100001001111110011111110001001011010010011111100111111100010110100001000111111100001110101100000111111001111110011111110001001011010010011111100111111111010000100110000111111 96ef83c93f3f3f3fe1e13f3f89693f3f8b423f87583f3f3f89693f3fe84c3f
EUC-JP 厄λ????矣??永??毅??洧??永??鎰? 1100110011110001101001101100101100111111001111110011111100111111111000101110001100111111001111111011000111001010001111110011111110110101101000110011111100111111100011111100011110110100001111110011111110110001110010100011111100111111111011111010110100111111 ccf1a6cb3f3f3f3fe2e33f3fb1ca3f3fb5a33f3f8fc7b43f3fb1ca3f3fefad3f
UTF-8 厄λ뱶痢믣깷矣명뮊永띠렲毅곻Ⅴ洧븍뼬永띠룊鎰쁁 1110010110001110100001001100111010111011111010111011000110110110111011111010011110100101111010111010111110100011111010101011100110110111111001111001111110100011111010111010101010000101111010111010111010001010111001101011000010111000111010111001110110100000111010111010000010110010111001101010111110000101111010101011001110111011111000101000010110100100111001101011010010100111111010111011100010001101111010111011110010101100111001101011000010111000111010111001110110100000111010111010001110001010111010011000111010110000111011001000000110000001 e58e84cebbebb1b6efa7a5ebafa3eab9b7e79fa3ebaa85ebae8ae6b0b8eb9da0eba0b2e6af85eab3bbe285a4e6b4a7ebb88debbcace6b0b8eb9da0eba38ae98eb0ec8181
UHC 厄λ뱶痢믣깷矣명뮊永띠렲毅곻Ⅴ洧븍뼬永띠룊鎰쁁 11100100111110001010010111101011100100111001110011101100101110001001001011100101100000111010010111101011111110001011100011101101100100101001100011100111101101011011011011101100100011101011111111101011111101101000000111101111101001011011010011101010111110111011101011101011100101101010111111100111101101011011011011101100100011111000100111101100111100001001100001000010 e4f8a5eb939cecb892e583a5ebf8b8ed9298e7b5b6ec8ebfebf681efa5b4eafbbaeb96afe7b5b6ec8f89ecf09842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)