To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????h?????????????? 0011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??ゅ?丹?ビ?h??ゅ?丹?ビ???ゅ??コ 00111111001111111000001011100011001111111001001001001111001111111000001101110010001111110110100000111111001111111000001011100011001111111001001001001111001111111000001101110010001111110011111100111111100000101110001100111111001111111000001101010010 3f3f82e33f924f3f83723f683f3f82e33f924f3f83723f3f3f82e33f3f8352
EUC-JP ??ゅ?丹?ビ?h??ゅ?丹?ビ???ゅ??コ 00111111001111111010010011100101001111111100001110110000001111111010010111010011001111110110100000111111001111111010010011100101001111111100001110110000001111111010010111010011001111110011111100111111101001001110010100111111001111111010010110110011 3f3fa4e53fc3b03fa5d33f683f3fa4e53fc3b03fa5d33f3f3fa4e53f3fa5b3
UTF-8 룵퓦ゅ룶丹춳ビ∼h룵퓦ゅ룶丹춳ビ∼룵퓦ゅ룴횕コ 11101011101000111011010111101101100100111010011011100011100000101000010111101011101000111011011011100100101110001011100111101100101101101011001111100011100000111001001111100010100010001011110001101000111010111010001110110101111011011001001110100110111000111000001010000101111010111010001110110110111001001011100010111001111011001011011010110011111000111000001110010011111000101000100010111100111010111010001110110101111011011001001110100110111000111000001010000101111010111010001110110100111011011001101010010101111000111000001010110011 eba3b5ed93a6e38285eba3b6e4b8b9ecb6b3e38393e288bc68eba3b5ed93a6e38285eba3b6e4b8b9ecb6b3e38393e288bceba3b5ed93a6e38285eba3b4ed9a95e382b3
UHC 룵퓦ゅ룶丹춳ビ∼h룵퓦ゅ룶丹춳ビ∼룵퓦ゅ룴횕コ 100011111010101010111111100011111010101011100101100011111010101111010011101000011010110110001111101010111101001110100001101011010110100010001111101010101011111110001111101010101110010110001111101010111101001110100001101011011000111110101011110100111010000110101101100011111010101010111111100011111010101011100101100011111010100111000011100011111010101110110011 8faabf8faae58fabd3a1ad8fabd3a1ad688faabf8faae58fabd3a1ad8fabd3a1ad8faabf8faae58fa9c38fabb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)