To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 褥??倚?┃攸??^ 1110010111110001001111110011111110011000110111110011111110000100101010111001110110111111001111110011111101011110 e5f13f3f98df3f84ab9dbf3f3f5e
EUC-JP 褥??倚?┃攸??^ 1110101011110011001111110011111111010000111000010011111110101000101011011101101011000001001111110011111101011110 eaf33f3fd0e13fa8addac13f3f5e
UTF-8 褥띠쥋倚딉┃攸곸넞^ 11101000101001001010010111101011100111011010000011101100101001011000101111100101100000001001101011101011100101001000100111100010100101001000001111100110100101001011100011101010101100111011100011101011100001001001111001011110 e8a4a5eb9da0eca58be5809aeb9489e29483e694b8eab3b8eb849e5e
UHC 褥띠쥋倚딉┃攸곸넞^ 11101001101100111011011011101100101000101000010011101011111011111000101011101111101001101010110111101010111100101000000111101100100001101010001001011110 e9b3b6eca284ebef8aefa6adeaf281ec86a25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)