To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 獄??揖??邑④??λ?艤g?膺??筌 10001101100101100011111100111111100101110100101100111111001111111001011101010111100001110100001100111111001111111000001111001001001111111110010001111110100000101000011100111111111001000101111000111111001111111110001010100011 8d963f3f974b3f3f975787433f3f83c93fe47e82873fe45e3f3fe2a3
EUC-JP 獄??揖??邑???λ?艤g?膺??筌 101110011111011000111111001111111100110110101100001111110011111111001101101110000011111100111111001111111010011011001011001111111110011111011111101000111110011100111111111001111011111100111111001111111110010010100101 b9f63f3fcdac3f3fcdb83f3f3fa6cb3fe7dfa3e73fe7bf3f3fe4a5
UTF-8 獄뷜뫖揖쇔젆邑④뻗若λ갭艤g뛾膺쇰짋筌 1110011110001101100001001110101110110111100111001110101110101011100101101110011010001111100101101110110010000111100101001110110010100000100001101110100110000010100100011110001010010001101000111110101110111011100101111110111110100101101101001100111010111011111010101011000010101101111010001000100110100100111011111011110110000111111010111001101110111110111010001000011010111010111011001000011110110000111011001010011110001011111001111010110110001100 e78d84ebb79cebab96e68f96ec8794eca086e98291e291a3ebbb97efa5b4cebbeab0ade889a4efbd87eb9bbee886baec87b0eca78be7ad8c
UHC 獄뷜뫖揖쇔젆邑④뻗若λ갭艤g뛾膺쇰짋筌 1110100010101011101110101110001010010001101110001110101111100111101111001110010110100000100010011110101111101001101010001110101010111011101110001110010110101110101001011110101110110000101110001110101111111010101000111110011110001101100001001110101111101100101111001110101110100011100101111110111110100111 e8abbae291b8ebe7bce5a089ebe9a8eabbb8e5aea5ebb0b8ebfaa3e78d84ebecbceba397efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)