To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??節??要??獄 1001111011110100001111110011111110010000110111110011111100111111100101110111011000111111001111111000110110010110 9ef43f3f90df3f3f97763f3f8d96
EUC-JP 橈??節??要??獄 1101110011110110001111110011111111000000111000010011111100111111110011011101011100111111001111111011100111110110 dcf63f3fc0e13f3fcdd73f3fb9f6
UTF-8 橈띲굫節쏙슬要쏉숱獄 111001101010100110001000111010111001110110110010111010101011010110101011111001111010111110000000111011001000111110011001111011001000101010101100111010001010011010000001111011001000111110001001111011001000100010110001111001111000110110000100 e6a988eb9db2eab5abe7af80ec8f99ec8aace8a681ec8f89ec88b1e78d84
UHC 橈띲굫節쏙슬要쏉숱獄 1110100011111010100011011110001110000010100100011110111110111101101111011110111110111101101111011110100110101001100110111110111110111101101000101110100010101011 e8fa8de38291efbdbdefbdbde9a99befbda2e8ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)