To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 贓?檣?助?尾旭?贓?檣?助?尾旭?^ 1110011011011001001111111001111011111100001111111000111110010101001111111001010011110110100010001010111000111111111001101101100100111111100111101111110000111111100011111001010100111111100101001111011010001000101011100011111101011110 e6d93f9efc3f8f953f94f688ae3fe6d93f9efc3f8f953f94f688ae3f5e
EUC-JP 贓?檣?助?尾旭?贓?檣?助?尾旭?^ 1110110011011011001111111101110011111110001111111011110111110101001111111100100011111000101100001011000000111111111011001101101100111111110111001111111000111111101111011111010100111111110010001111100010110000101100000011111101011110 ecdb3fdcfe3fbdf53fc8f8b0b03fecdb3fdcfe3fbdf53fc8f8b0b03f5e
UTF-8 贓렩檣렋助累尾旭뒷贓렩檣렋助累尾旭뒬^ 11101000101101001001001111101011101000001010100111100110101010101010001111101011101000001000101111100101100010101010100111101111101001011000111111100101101100001011111011100110100101111010110111101011100100101011011111101000101101001001001111101011101000001010100111100110101010101010001111101011101000001000101111100101100010101010100111101111101001011000111111100101101100001011111011100110100101111010110111101011100100101010110001011110 e8b493eba0a9e6aaa3eba08be58aa9efa58fe5b0bee697adeb92b7e8b493eba0a9e6aaa3eba08be58aa9efa58fe5b0bee697adeb92ac5e
UHC 贓렩檣렋助累尾旭뒷贓렩檣렋助累尾旭뒬^ 11101101111111001000111010110111111011011110101010001110101000101111000010111110110100101110100111011010101011011110100111101111101101011101111011101101111111001000111010110111111011011110101010001110101000101111000010111110110100101110100111011010101011011110100111101111101101011101110001011110 edfc8eb7edea8ea2f0bed2e9daade9efb5deedfc8eb7edea8ea2f0bed2e9daade9efb5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)