To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?夭?漿??漿??雰?夭?漿??漿??雰B 0011111110011010111011100011111110011111111101110011111100111111100111111111011100111111001111111001010110110101001111111001101011101110001111111001111111110111001111110011111110011111111101110011111100111111100101011011010101000010 3f9aee3f9ff73f3f9ff73f3f95b53f9aee3f9ff73f3f9ff73f3f95b542
EUC-JP ?夭?漿??漿??雰?夭?漿??漿??雰B 0011111111010100111100000011111111011110111110010011111100111111110111101111100100111111001111111100101010110111001111111101010011110000001111111101111011111001001111110011111111011110111110010011111100111111110010101011011101000010 3fd4f03fdef93f3fdef93f3fcab73fd4f03fdef93f3fdef93f3fcab742
UTF-8 뤚夭비漿쨴쿰漿쫷퀚雰뤚夭비漿쨴쿰漿쫷퀚雰B 11101011101001001001101011100101101001001010110111101011101110011000010011100110101111001011111111101100101010001011010011101100101111111011000011100110101111001011111111101100101010111011011111101101100000001001101011101001100110111011000011101011101001001001101011100101101001001010110111101011101110011000010011100110101111001011111111101100101010001011010011101100101111111011000011100110101111001011111111101100101010111011011111101101100000001001101011101001100110111011000001000010 eba49ae5a4adebb984e6bcbfeca8b4ecbfb0e6bcbfecabb7ed809ae99bb0eba49ae5a4adebb984e6bcbfeca8b4ecbfb0e6bcbfecabb7ed809ae99bb042
UHC 뤚夭비漿쨴쿰漿쫷퀚雰뤚夭비漿쨴쿰漿쫷퀚雰B 1000111111001001111010001110110010111010111100011110110111101100101001001000111011000100111100011110110111101100101001101000111010110011100011101101110111010100100011111100100111101000111011001011101011110001111011011110110010100100100011101100010011110001111011011110110010100110100011101011001110001110110111011101010001000010 8fc9e8ecbaf1edeca48ec4f1edeca68eb38eddd48fc9e8ecbaf1edeca48ec4f1edeca68eb38eddd442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)