To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???魏??音??癰???鸚??伊???⑦? 00111111001111110011111111101001101100000011111100111111100010011011100100111111001111111110000110011110001111110011111100111111111010100101111100111111001111111000100011001001001111110011111100111111100001110100011000111111 3f3f3fe9b03f3f89b93f3fe19e3f3f3fea5f3f3f88c93f3f3f87463f
EUC-JP ???魏??音??癰???鸚??伊????? 001111110011111100111111111100101011001000111111001111111011001010111011001111110011111111100001111111100011111100111111001111111111001111000000001111110011111110110000110010110011111100111111001111110011111100111111 3f3f3ff2b23f3fb2bb3f3fe1fe3f3f3ff3c03f3fb0cb3f3f3f3f3f
UTF-8 捻뀀맦魏긷츦音쎌춷癰궽쇱젷鸚쒓퍔伊싷쭓戮⑦뒅 111011111010011010100100111010111000000010000000111010111010011110100110111010011010110110001111111010101011100010110111111011001011100010100110111010011001111110110011111011001000111010001100111011001011011010110111111001111001100110110000111010101011011010111101111011001000011110110001111011001010000010110111111010011011100010011010111011001001001010010011111011011000110110010100111001001011110010001010111011001000101110110111111011001010110110010011111011111010011110010010111000101001000110100110111010111001001010000101 efa6a4eb8080eba7a6e9ad8feab8b7ecb8a6e99fb3ec8e8cecb6b7e799b0eab6bdec87b1eca0b7e9b89aec9293ed8d94e4bc8aec8bb7ecad93efa792e291a6eb9285
UHC 捻뀀맦魏긷츦音쎌춷癰궽쇱젷鸚쒓퍔伊싷쭓戮⑦뒅 1110011011110111101100101110101110010000101011111110101011100000101100011110010110101110100111001110101111100101101111011110110010101101100100111110100010111001100000101100111010111100111011001010000010101011111001011010010010011100111010101011101110001011111011001010010110011010111011111010011110001011111010111011110110101000111011011000101010000011 e6f7b2eb90afeae0b1e5ae9cebe5bdecad93e8b982cebceca0abe5a49ceabb8beca59aefa78bebbda8ed8a83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)