To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 咀坎???蛔?膺?沮瀑?膺?鷹?咀坎?縫?^ 100110011111000010011010101010100011111100111111001111111110010101111011001111111110010001011110001111111001111110011100111000000110010100111111111001000101111000111111100100011110100100111111100110011111000010011010101010100011111110010110010001000011111101011110 99f09aaa3f3f3fe57b3fe45e3f9f9ce0653fe45e3f91e93f99f09aaa3f96443f5e
EUC-JP 咀坎?熢?蛔?膺?沮瀑?膺?鷹?咀坎?縫?^ 1101001011110010110101001010110000111111100011111100101010101011001111111110100111011100001111111110011110111111001111111101110111111100110111111100011000111111111001111011111100111111110000101110101100111111110100101111001011010100101011000011111111001011101001010011111101011110 d2f2d4ac3f8fcaab3fe9dc3fe7bf3fddfcdfc63fe7bf3fc2eb3fd2f2d4ac3fcba53f5e
UTF-8 咀坎렩熢뤈蛔뵘膺뢸沮瀑뵘膺뢸鷹뫘咀坎렩縫진^ 11100101100100101000000011100101100111011000111011101011101000001010100111100111100001101010001011101011101001001000100011101000100110111001010011101011101101011001100011101000100001101011101011101011101000101011100011100110101100101010111011100111100000001001000111101011101101011001100011101000100001101011101011101011101000101011100011101001101101111011100111101011101010111001100011100101100100101000000011100101100111011000111011101011101000001010100111100111101110001010101111101100101001111000010001011110 e59280e59d8eeba0a9e786a2eba488e89b94ebb598e886baeba2b8e6b2aee78091ebb598e886baeba2b8e9b7b9ebab98e59280e59d8eeba0a9e7b8abeca7845e
UHC 咀坎렩熢뤈蛔뵘膺뢸沮瀑뵘膺뢸鷹뫘咀坎렩縫진^ 11101110101110101100101011101100100011101011011111011100111011001000111110111000111111001110111010111010110010101110101111101100101101111101110011101110110000011111100011101110101110101100101011101011111011001011011111011100111010111110110110111000111111001110111010111010110010101110110010001110101101111101110011101110110000011111100001011110 eebacaec8eb7dcec8fb8fceebacaebecb7dceec1f8eebacaebecb7dcebedb8fceebacaec8eb7dceec1f85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)