To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇??鎰??蟻??畏?????揄?????裕 11101010100001110011111100111111111010000100110000111111001111111000101101100001001111110011111110001000110110000011111100111111001111110011111100111111100111011000100100111111001111110011111100111111001111111001011101010100 ea873f3fe84c3f3f8b613f3f88d83f3f3f3f3f9d893f3f3f3f3f9754
EUC-JP 鼇??鎰??蟻??畏?????揄?????裕 11110011111001110011111100111111111011111010110100111111001111111011010111000010001111110011111110110000110110100011111100111111001111110011111100111111110110011110100100111111001111110011111100111111001111111100110110110101 f3e73f3fefad3f3fb5c23f3fb0da3f3f3f3f3fd9e93f3f3f3f3fcdb5
UTF-8 鼇앸뵃鎰믤슅蟻띿쁺畏븐꼱六쀥슖揄뺥닂捻뚭염裕 111010011011110010000111111011001001010110111000111010111011010110000011111010011000111010110000111010111010111110100100111011001000101010000101111010001001111110111011111010111001110110111111111011001000000110111010111001111001010110001111111010111011100010010000111010101011110010110001111011111010011110010001111011001000000010100101111011001000101010010110111001101000111110000100111010111011101010100101111010111000101110000010111011111010011010100100111010111001101010101101111011001001011110111100111010001010001110010101 e9bc87ec95b8ebb583e98eb0ebafa4ec8a85e89fbbeb9dbfec81bae7958febb890eabcb1efa791ec80a5ec8a96e68f84ebbaa5eb8b82efa6a4eb9aadec97bce8a395
UHC 鼇앸뵃鎰믤슅蟻띿쁺畏븐꼱六쀥슖揄뺥닂捻뚭염裕 1110100010101000100111011110101110010100100010011110110011110000100100101110011010011010100101111110101111111100100011011110110010011000100000011110100011100110101110101110110010000100100010111110101110111011100101111110010110011010101001011110101011110001100101011110110110001000100010111110011011110111100011001110101010111111101100001110101110101110 e8a89deb9489ecf092e69a97ebfc8dec9881e8e6baec848bebbb97e59aa5eaf195ed888be6f78ceabfb0ebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)