To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 渦??擁?????}渦??擁?????{^ 10001001010100010011111100111111100101110110100100111111001111110011111100111111001111110111110110001001010100010011111100111111100101110110100100111111001111110011111100111111001111110111101101011110 89513f3f97693f3f3f3f3f7d89513f3f97693f3f3f3f3f7b5e
EUC-JP 渦??擁?????}渦??擁?????{^ 10110001101100100011111100111111110011011100101000111111001111110011111100111111001111110111110110110001101100100011111100111111110011011100101000111111001111110011111100111111001111110111101101011110 b1b23f3fcdca3f3f3f3f3f7db1b23f3fcdca3f3f3f3f3f7b5e
UTF-8 渦욕뎴擁녕궘銳볞퓱}渦욕뎴擁녕궘銳볞퓱{^ 111001101011100010100110111011001001101010010101111010111000111010110100111001101001001110000001111010111000010110010101111010101011011010011000111010011000101010110011111010111011001110011110111011011001001110110001011111011110011010111000101001101110110010011010100101011110101110001110101101001110011010010011100000011110101110000101100101011110101010110110100110001110100110001010101100111110101110110011100111101110110110010011101100010111101101011110 e6b8a6ec9a95eb8eb4e69381eb8595eab698e98ab3ebb39eed93b17de6b8a6ec9a95eb8eb4e69381eb8595eab698e98ab3ebb39eed93b17b5e
UHC 渦욕뎴擁녕궘銳볞퓱}渦욕뎴擁녕궘銳볞퓱{^ 111010001011111010111111111001011000100110000111111010001011011010110011111001111000001010101101111001111110010110010011111001001011111110010111011111011110100010111110101111111110010110001001100001111110100010110110101100111110011110000010101011011110011111100101100100111110010010111111100101110111101101011110 e8bebfe58987e8b6b3e782ade7e593e4bf977de8bebfe58987e8b6b3e782ade7e593e4bf977b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)