To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??]nf??]n^}Y??]nf??]n^}bE 00111111001111110101110101101110011001100011111100111111010111010110111001011110011111010101100100111111001111110101110101101110011001100011111100111111010111010110111001011110011111010110001001000101 3f3f5d6e663f3f5d6e5e7d593f3f5d6e663f3f5d6e5e7d6245
SJIS-WIN 腺訛]nf腺訛]n^}Y腺訛]nf腺訛]n^}bE 100100010100001011100110011000010101110101101110011001101001000101000010111001100110000101011101011011100101111001111101010110011001000101000010111001100110000101011101011011100110011010010001010000101110011001100001010111010110111001011110011111010110001001000101 9142e6615d6e669142e6615d6e5e7d599142e6615d6e669142e6615d6e5e7d6245
EUC-JP 腺訛]nf腺訛]n^}Y腺訛]nf腺訛]n^}bE 110000011010001111101011110000100101110101101110011001101100000110100011111010111100001001011101011011100101111001111101010110011100000110100011111010111100001001011101011011100110011011000001101000111110101111000010010111010110111001011110011111010110001001000101 c1a3ebc25d6e66c1a3ebc25d6e5e7d59c1a3ebc25d6e66c1a3ebc25d6e5e7d6245
UTF-8 腺訛]nf腺訛]n^}Y腺訛]nf腺訛]n^}bE 1110100010000101101110101110100010101000100110110101110101101110011001101110100010000101101110101110100010101000100110110101110101101110010111100111110101011001111010001000010110111010111010001010100010011011010111010110111001100110111010001000010110111010111010001010100010011011010111010110111001011110011111010110001001000101 e885bae8a89b5d6e66e885bae8a89b5d6e5e7d59e885bae8a89b5d6e66e885bae8a89b5d6e5e7d6245
UHC 腺訛]nf腺訛]n^}Y腺訛]nf腺訛]n^}bE 111000001100110111101000110001010101110101101110011001101110000011001101111010001100010101011101011011100101111001111101010110011110000011001101111010001100010101011101011011100110011011100000110011011110100011000101010111010110111001011110011111010110001001000101 e0cde8c55d6e66e0cde8c55d6e5e7d59e0cde8c55d6e66e0cde8c55d6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)