To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??汚??梧??釗??節??節ч?節 100011001110011000111111001111111000100110011000001111110011111110001100111001100011111100111111111110111011101100111111001111111001000011011111001111110011111110010000110111111000010010001001001111111001000011011111 8ce63f3f89983f3f8ce63f3ffbbb3f3f90df3f3f90df84893f90df
EUC-JP 梧??汚??梧??釗??節??節ч?節 10111000111010000011111100111111101100011111100000111111001111111011100011101000001111110011111110001111111000111010011000111111001111111100000011100001001111110011111111000000111000011010011111101001001111111100000011100001 b8e83f3fb1f83f3fb8e83f3f8fe3a63f3fc0e13f3fc0e1a7e93fc0e1
UTF-8 梧귨쉠汚억슬梧잌쨰釗녘뮰節띈쾫節ч닎節 1110011010100010101001111110101010110111101010001110110010001001101000001110011010110001100110101110110010010110101101011110110010001010101011001110011010100010101001111110110010011110100011001110110010101000101100001110100110000111100101111110101110000101100110001110101110101110101100001110011110101111100000001110101110011101100010001110110010111110101010111110011110101111100000001101000110000111111010111000101110001110111001111010111110000000 e6a2a7eab7a8ec89a0e6b19aec96b5ec8aace6a2a7ec9e8ceca8b0e98797eb8598ebaeb0e7af80eb9d88ecbeabe7af80d187eb8b8ee7af80
UHC 梧귨쉠汚억슬梧잌쨰釗녘뮰節띈쾫節ч닎節 1110011111111100100000101110111110111101101010101110011111111101101111101110111110111101101111011110011111111100100111111110010110100100100010101110000111110010101100111110100010010010101110011110111110111101101101101110100010110010100000101110111110111101101011001110100110001000100101001110111110111101 e7fc82efbdaae7fdbeefbdbde7fc9fe5a48ae1f2b3e892b9efbdb6e8b282efbdace98894efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)