To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰??毅??癒⑥?齬??愉??恂ъ?厭 10001001100000010011111100111111100010110100001000111111001111111001011011111100100001110100010100111111111010101001011100111111001111111001011011111001001111110011111110011100100101101000010010001100001111111000100101111101 89813f3f8b423f3f96fc87453fea973f3f96f93f3f9c96848c3f897d
EUC-JP 堰??毅??癒??齬??愉??恂ъ?厭 101100011110000100111111001111111011010110100011001111110011111111001100111111100011111100111111111100111111011100111111001111111100110011111011001111110011111111010111111101101010011111101100001111111011000111011110 b1e13f3fb5a33f3fccfe3f3ff3f73f3fccfb3f3fd7f6a7ec3fb1de
UTF-8 堰쇨쑴毅싨끽癒⑥뿹齬잙벊愉녑렚恂ъ뜭厭 1110010110100000101100001110110010000111101010001110110010010001101101001110011010101111100001011110110010001011101010001110101110000001101111011110011110011001100100101110001010010001101001011110101110111111101110011110100110111101101011001110110010011110100110011110101110110010100010101110011010000100100010011110101110000101100100011110101110100000100110101110011010000001100000101101000110001010111010111001110010101101111001011000111010101101 e5a0b0ec87a8ec91b4e6af85ec8ba8eb81bde79992e291a5ebbfb9e9bdacec9e99ebb28ae68489eb8591eba09ae68182d18aeb9cade58ead
UHC 堰쇨쑴毅싨끽癒⑥뿹齬잙벊愉녑렚恂ъ뜭厭 1110010111101000101111001110101010111110101010011110101111110110100110101110011010110011101000111110101110101000101010001110110010010111101110011110010111100001100111111110101110010011101011011110101011110000101100111110010110001110101011011110001011100001101011001110110010001101101011011110011011110100 e5e8bceabea9ebf69ae6b3a3eba8a8ec97b9e5e19feb93adeaf0b3e58eade2e1acec8dade6f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)