To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??泣?ぜ釉??嚴ъ?吟??恂レ?筌??? 1110100111110010001111110011111110001011100000110011111110000010101110101110011111010110001111110011111110011010100011101000010010001100001111111000101111100001001111110011111110011100100101101000001110001100001111111110001010100011001111110011111100111111 e9f23f3f8b833f82bae7d63f3f9a8e848c3f8be13f3f9c96838c3fe2a33f3f3f
EUC-JP 鶯??泣?ぜ釉??嚴ъ?吟??恂レ?筌??彛 11110010111101000011111100111111101101011110001100111111101001001011110011101110110110000011111100111111110100111110111010100111111011000011111110110110111000110011111100111111110101111111011010100101111011000011111111100100101001010011111100111111100011111011110011111010 f2f43f3fb5e33fa4bceed83f3fd3eea7ec3fb6e33f3fd7f6a5ec3fe4a53f3f8fbcfa
UTF-8 鶯볤쑴泣쒑ぜ釉앹쵁嚴ъ쥙吟믥솾恂レ돇筌뉛퐦彛 1110100110110110101011111110101110110011101001001110110010010001101101001110011010110011101000111110110010010010100100011110001110000001100111001110100110000111100010011110110010010101101110011110110010110101100000011110010110011010101101001101000110001010111011001010010110011001111001011001000010011111111010111010111110100101111011001000011010111110111001101000000110000010111000111000001110101100111010111000111110000111111001111010110110001100111010111000100110011011111011011001000010100110111001011011110110011011 e9b6afebb3a4ec91b4e6b3a3ec9291e3819ce98789ec95b9ecb581e59ab4d18aeca599e5909febafa5ec86bee68182e383aceb8f87e7ad8ceb899bed90a6e5bd9b
UHC 鶯볤쑴泣쒑ぜ釉앹쵁嚴ъ쥙吟믥솾恂レ돇筌뉛퐦彛 1110010110100011100100111110101010111110101010011110101111101000100111001110100010101010101111001110101110111000100111011110110010101100100000111110010111110001101011001110110010100010100011101110101111100001100100101110011110011001101100101110001011100001101010111110110010001001100110001110111110100111100001111110111110111101100011111110110010101101 e5a393eabea9ebe89ce8aabcebb89decac83e5f1aceca28eebe192e799b2e2e1abec8998efa787efbd8fecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)