To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???飮??音?????飮??伎逸??兪?? 00111111001111110011111110011111010110100011111100111111100010011011100100111111001111110011111100111111001111111001111101011010001111110011111110001010111010101000100011101101001111110011111110011001011000000011111100111111 3f3f3f9f5a3f3f89b93f3f3f3f3f9f5a3f3f8aea88ed3f3f99603f3f
EUC-JP ???飮??音?????飮??伎逸??兪?? 00111111001111110011111111011101101110110011111100111111101100101011101100111111001111110011111100111111001111111101110110111011001111110011111110110100111011001011000011101111001111110011111111010001110000010011111100111111 3f3f3fddbb3f3fb2bb3f3f3f3f3fddbb3f3fb4ecb0ef3f3fd1c13f3f
UTF-8 凉깅냵飮긷쩂音쎌댅凉깅냵飮긷럳伎逸뜹선兪낆댇 111011111010010110111001111010101011100110000101111010111000001110110101111010011010001110101110111010101011100010110111111011001010100110000010111010011001111110110011111011001000111010001100111010111000110010000101111011111010010110111001111010101011100110000101111010111000001110110101111010011010001110101110111010101011100010110111111010111001111110110011111001001011110010001110111010011000000010111000111010111001110010111001111011001000010010100000111001011000010110101010111010111000001010000110111010111000110010000111 efa5b9eab985eb83b5e9a3aeeab8b7eca982e99fb3ec8e8ceb8c85efa5b9eab985eb83b5e9a3aeeab8b7eb9fb3e4bc8ee980b8eb9cb9ec84a0e585aaeb8286eb8c87
UHC 凉깅냵飮긷쩂音쎌댅凉깅냵飮긷럳伎逸뜹선兪낆댇 1110010110111100101100011110101110000110100001011110101111100110101100011110010110100100100111001110101111100101101111011110110010001000101011111110010110111100101100011110101110000110100001011110101111100110101100011110010110001110100100111101000011101011111011001110111110110110111001011011110010110001111010101110010010000101111011001000100010110001 e5bcb1eb8685ebe6b1e5a49cebe5bdec88afe5bcb1eb8685ebe6b1e58e93d0ebecefb6e5bcb1eae485ec88b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)