To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 翁??竭誌媛????竭誌才?翁??竭誌莊?^ 100010011010010100111111001111111110001010010001100011101000111110010101010100010011111100111111001111110011111111100010100100011000111010001111100011011100101100111111100010011010010100111111001111111110001010010001100011101000111111100100101101010011111101011110 89a53f3fe2918e8f95513f3f3f3fe2918e8f8dcb3f89a53f3fe2918e8fe4b53f5e
EUC-JP 翁??竭誌媛????竭誌才?翁??竭誌莊?^ 101100101010011100111111001111111110001111110001101110111110111111001001101100100011111100111111001111110011111111100011111100011011101111101111101110101100110100111111101100101010011100111111001111111110001111110001101110111110111111101000101101110011111101011110 b2a73f3fe3f1bbefc9b23f3f3f3fe3f1bbefbacd3fb2a73f3fe3f1bbefe8b73f5e
UTF-8 翁댓렱竭誌媛뷸렪댓렱竭誌才렱翁댓렱竭誌莊렱^ 11100111101111111000000111101011100011001001001111101011101000001011000111100111101010111010110111101000101010101000110011100101101010101001101111101011101101111011100011101011101000001010101011101011100011001001001111101011101000001011000111100111101010111010110111101000101010101000110011100110100010011000110111101011101000001011000111100111101111111000000111101011100011001001001111101011101000001011000111100111101010111010110111101000101010101000110011101000100011101000101011101011101000001011000101011110 e7bf81eb8c93eba0b1e7abade8aa8ce5aa9bebb7b8eba0aaeb8c93eba0b1e7abade8aa8ce6898deba0b1e7bf81eb8c93eba0b1e7abade8aa8ce88e8aeba0b15e
UHC 翁댓렱竭誌媛뷸렪댓렱竭誌才렱翁댓렱竭誌莊렱^ 11101000101110101011010011110001100011101011111011001010111001101111001010111100111010101011000010111010111001101000111010111000101101001111000110001110101111101100101011100110111100101011110011101110101001101000111010111110111010001011101010110100111100011000111010111110110010101110011011110010101111001110110111110110100011101011111001011110 e8bab4f18ebecae6f2bceab0bae68eb8b4f18ebecae6f2bceea68ebee8bab4f18ebecae6f2bcedf68ebe5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)