To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????TB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN 偲辞篠璽篠叱偲汐篠磁篠ヤナ爾偲゙トナノTB 10001110110000111000111010101011100011101100001010001110101000111000111011000010100011101011011010001110110000111000111010101100100011101100001010001110101001011000111011000010110101001100010110001110101000101000111011000011110111101100010011000101110010010101010001000010 8ec38eab8ec28ea38ec28eb68ec38eac8ec28ea58ec2d4c58ea28ec3dec4c5c95442
EUC-JP 偲辞篠璽篠叱偲汐篠磁篠ヤナ爾偲゙トナノTB 10111100110001011011110010101101101111001100010010111100101001011011110011000100101111001011100010111100110001011011110010101110101111001100010010111100101001111011110011000100100011101101010010001110110001011011110010100100101111001100010110001110110111101000111011000100100011101100010110001110110010010101010001000010 bcc5bcadbcc4bca5bcc4bcb8bcc5bcaebcc4bca7bcc48ed48ec5bca4bcc58ede8ec48ec58ec95442
UTF-8 偲辞篠璽篠叱偲汐篠磁篠ヤナ爾偲゙トナノTB 1110010110000001101100101110100010111110100111101110011110101111101000001110011110010010101111011110011110101111101000001110010110001111101100011110010110000001101100101110011010110001100100001110011110101111101000001110011110100011100000011110011110101111101000001110111110111110100101001110111110111110100001011110011110001000101111101110010110000001101100101110111110111110100111101110111110111110100001001110111110111110100001011110111110111110100010010101010001000010 e581b2e8be9ee7afa0e792bde7afa0e58fb1e581b2e6b190e7afa0e7a381e7afa0efbe94efbe85e788bee581b2efbe9eefbe84efbe85efbe895442
UHC ??篠璽篠叱?汐篠磁篠??爾?????TB 001111110011111111100001110001101101111111011110111000011100011011110010111010100011111111100000101100011110000111000110111011011011100011100001110001100011111100111111111011001011001100111111001111110011111100111111001111110101010001000010 3f3fe1c6dfdee1c6f2ea3fe0b1e1c6edb8e1c63f3fecb33f3f3f3f3f5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)